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Abstract. In 1952, Heinrich Scholz published a question in the Journal of Sym- 
bolic Logic asking for a characterization of spectra, i.e., sets of natural numbers 
that are the cardinalities of finite models of first order sentences. Giinter Asser 
asked whether the complement of a spectrum is always a spectrum. These innocent 
questions turned out to be seminal for the development of finite model theory and 
descriptive complexity. In this paper we survey developments over the last 50-odd 
years pertaining to the spectrum problem. Our presentation follows conceptual devel- 
opments rather than the chronological order. Originally a number theoretic problem, 
it has been approached in terms of recursion theory, resource bounded complexity 
theory, classification by complexity of the defining sentences, and finally in terms of 
structural graph theory. Although Scholz' question was answered in various ways, 
Asser's question remains open. One appendix paraphrases the contents of several 
early and not easily accesible papers by G. Asser, A. Mostowski, J. Bennett and S. 
Mo. Another appendix contains a compendium of questions and conjectures which 
remain open. 
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Sganarelle: Ah! Monsieur, e'est un spectre: 

je le reconnais au marcher. 

Dom Juan: Spectre, fantome, ou diable, 

je veux voir ce que e'est. 

J.B. Poquclin, dit Molicrc, Dom Juan, Acte V, scene V 

§1. Introduction. At the Annual Symposium of the European As- 
sociation of Computer Science Logic, CSL'05, held in Oxford in 2005, 
Arnaud Durand, Etienne Grandjean and Malika More organized a special 
workshop dedicated to the spectrum problem. The workshop speakers 
and the title of their talks where 

• Annie Chateau (UQAM, Montreal) 

The Ultra- Weak Ash Conjecture is Equivalent to the Spectrum Con- 
jecture, and Some Relative Results 

• Mor Doron (Hebrew University, Jerusalem). 

Weakly Decomposable Classes and Their Spectra (joint work with 
S. Shelah), 

• Aaron Hunter (Simon Fraser University, Burnaby). 

Closure Results for First-Order Spectra: The Model Theoretic Ap- 
proach 

• Neil Immerman (University of Massachusetts, Amherst) 
Recent Progress in Descriptive Complexity 

• Neil Jones (University of Copenhagen, Copenhagen) 
Some remarks on the spectrum problem 

• Johann A. Makowsky (Technion-Israel Institute of Technology, Haifa) 
50 years of the spectrum problem 

The organizers and speakers then decided to use the occasion to expand 
the survey talk given by J. A. Makowsky into the present survey paper, 
rather than publish the talks. 

Acknowledgements. We would like to thank A. Chateau, M. Doron, A. 
Esbelin, R. Fagin, E. Fischer, E. Grandjean, A. Hunter, N. Immerman 
and S. Shelah for their fruitful comments which helped our preparation 
of this survey. 
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§2. The Emergence of the Spectrum Problem. 

2.1. Scholz's problem. In 1952, H. Scholz published an innocent 
question in the Journal of Symbolic Logic [116]: 

1. Ein ungelostes Problem in der symbolischcn Logik. IK sei der 
Pradikatenkalkiil der ersten Stufe mit der Identitat. In IK ist ein Pos- 
tulatensystem BP fiir die Boole'sche Algebra mit einer einzigen zweis- 
telligen Pradikatenvariablen formalisierbar. 9 sei die Konjunktion der 
Postulate von PB. Dann ist 9 fiir endliche m m-zahlig erfullbar genau 
dann, wcnnn cs ein n > gibt, sodafi m — 2™. 

Hieraus ergibt sich das folgende Problem. H sei ein Ausdruck des 
IK. Untcr dem Spectrum von H soil die Menge der natiirlichcn Zahlcn 
verstanden sein, fiir welche H erfullbar ist. M sei eine beliebige Menge 
von natiirlichcn Zahlen. Gesucht ist eine hinreichcnde [hinrerichcndc] 
und notwendige Bedingung dafiir, dafi es ein H gibt, sodaB M das 
Spectrum von H ist. 
(Received September 19, 1951). 

In English: 

1. An unsolved problem in symbolic logic. Let IK [Idcntitatskalkul] 
be the first order predicate calculus with identity. In IK one can 
formalize an axiom system BP [Boole'sche Postulate] for Boolean al- 
gebras with only one binary relation variable. Let 9 be the conjunction 
of the axioms of BP. Then 9 is satisfiable in a finite domain of m 
elements if and only if there is an n > such that m = 2™. 

From this results the following problem. Let H be an expression of 
IK. We call the set of natural numbers, for which H is satisfiable, the 
spectrum of H. Let M be an arbitrary set of natural numbers. We 
look for a sufficient and necessary condition that ensures that there 
exists an H, such that M is the spectrum of H . 
(Received September 19, 1951). 

This question inaugurated a new column of Problems to be published in 
the Journal of Symbolic Logic and edited by L. Henkin. Other ques- 
tions published in the same issue were authored by G. Kreisel and L. 
Henkin. They deal with a question about interpretations of non-finitist 
proofs dealing with recursive ordinals and the no-counter-example inter- 
pretation (Kreisel) , the provability of formulas asserting the provability or 
independence of provability assertions (Henkin) , and the question whether 
the ordering principle is equivalent to the axiom of choice (Henkin). All 
in all 9 problems were published, the last in 1956. 

The context in which Scholz's question was formulated is given by the 
various completeness and incompleteness results for First Order Logic 
that were the main concern of logicians of the period. An easy con- 
sequence of Godel's classical completeness theorem of 1929 states that 
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validity of first order sentences in all (finite and infinite) structures is re- 
cursively enumerable, whereas Church's and Turing's classical theorems 
state that it is not recursive. In contrast to this, the following was shown 
in 1950 by B. Trakhtenbrot. 

Theorem 2.1 (Trakhtenbrot 50[126]). Validity of first order sentences 
in all finite structures (f-validity) is not recursively enumerable, and hence 
satisfiability of first order sentences in some finite structure (f- satisfiability) 
is not decidable, although it is recursively enumerable. 



Heinrich Scholz, a German philosopher, was born 17. December 1884 
in Berlin and died 30. December 1956 in Munster. He was a student 
of Adolf von Harnack. He studied in Berlin and Erlangen philosophy 
and theology and got his habilitation in 1910 in Bcrin for the sub- 
jects philosophy of religion and systematic theology. He received his 
Ph.D. in 1913 for his thesis Schleiermacher and Goethe. A contri- 
bution to the history of German thought. In 1917 he was appointed 
full professor for philosophy of religion in Breslau (Wroclaw, today 
Poland). In 1919 he moved to Kiel, and from 1928 on he taught in 
Munster. From 1924 till 1928 he studied exact sciences and logic and 
formed in Munster a center for mathematical logic and foundational 
studies, later to be known as the school of Munster. His chair became 
in 1936 the first chair for mathematical logic and the foundations of 
exact sciences. His seminar underwent several administrative meta- 
morphoses that culminated in 1950 in the creation of the Institute 
for mathematical logic and the foundations of exact sciences, which 
he led until his untimely death. Among his pupils and collaborators 
we find W. Ackermann, F. Bachmann, G. Hasenjager, H. Hermes, K. 
Schroter and H. Schweitzer. He was also among the founders of the 
German society bearing the same name (DVMLG). H. Scholz was a 
Platonist, and he considered mathematical logic as the foundation of 
epistemology. He is credited for his discovery of Frege's estate, and 
for making Frege's writing accessible to a wider readership. Together 
with his pupil Hasenjager he authored the monograph Grundziige der 
Mathematischen Logik, published posthumously in 1961. 



Thus, H. Scholz really asked whether one could prove anything mean- 
ingful about f-satisfiablity besides its undecidability. 

2.2. Basic facts and questions. In our notation to be used through- 
out the paper, H. Scholz introduced the following: 

Let r be a vocabulary, i.e., set of relation and function symbols. Let 
4> be a sentence in some logic with equality over a vocabulary r. Unless 
otherwise stated the logic will be first order logic FOL(t). Sometimes we 
shall also discuss second order logic, or a fragment thereof. 
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Definition 2.2. The spectrum SPEC(0) of 4> is the set of finite cardi- 
nalities (viewed as a subset of N), in which <ft has a model. 
We denote by Spec the set of spectra of first-order sentences, i.e., 

Spec = {spec((/>) | 4> is a first-order formula} 

We shall use S, Si to denote spectra. For the definition of spectra it does 
not matter whether we use function symbols or not. So, unless otherwise 
stated, vocabularies will be without function symbols. However, we shall 
allow function symbols when dealing with sentences of special forms. 

Clearly, spec(</>) = if and only if <p is not f-satisfiable. By definition 
of satisfiability is never part of a spectrum. Very often a spectrum is 
finite, cofinite or even of the form N + = N — {0}. 

Question 2.3. Is it decidable whether, for a given 4>, spec(^) = N + ? 

As H. Scholz noted, (the set of) powers of 2 form a spectrum, because 
they are the cardinalities of finite Boolean algebras. Similarly, powers 
of primes form a spectrum, because they are the cardinalities of finite 
fields. For a,b £ N + there are many ways to construct a sentence <f> with 
SPEC(</>) = a + 6N, one of which consists in using one unary function 
symbol. With a moment of reflection, one sees that spectra have the 
following closure properties. 

Proposition 2.4. Let S\ and S 2 be spectra. 

(i) Then S\ U S2, S\ n S2 are also spectra. 

(ii) Let S1 + S2 = {m + n : in G Si,n G S2}. Then S1 + S2 is a spectrum. 

(iii) Let S\ * S2 = {m ■ n : m G Si, n G S2}. Then S± * S2 is a spectrum. 

In the spirit of Question 12.31 we can also ask: 

Question 2.5. Which of the following sets are recursive? The set of 
sentences 4> such that 

(i) spec(</>) is finite, cofinite. 

(ii) SPEc(^) is ultimately periodic. 

(iii) spec(</>) is, for given a, b G N of the form a + bN. 

(iv) spec(0) = S for a given set S C N+. 

We shall answer Questions 12.31 and 12.51 in Section [3.4i 

2.3. Immediate responses to H. Scholz's problem. The first to 
publish a paper in response to H. Scholz's problem was G. Asser [7]. A. 
Robinson's review [112] summarizes it as follows: 

(...) The present paper is concerned with the characterisation of all 
representable sets [=spectra]. A rather intricate necessary and suffi- 
cient condition is stated for arithmetical function X(n) to be the char- 
acteristic function of a representable set. The condition shows that 
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such a function is elementary in the sense of Kalmar. (...) On the 
other hand, the author establishes that there exist non-representable 
sets whose characteristic function is elementary. Examples of rep- 
resentable sets (some of which are by no means obvious) are given 
without proof and the author suggests that further research in this 
field is desirable. 

Asser also noted that his characterization did not establish whether the 

complement of a spectrum is a spectrum. 
About the same time, A. Mostowski [98] also considered the problem. 

H. Curry [29] summarizes Mostowski's paper as follows: 

(...) The author proves that for each function f(n) of a class K of 
functions, which is like the class of primitive functions except that at 
each step all functions are truncated above at n, there is a formula H 
that has a model in a set of n + 1 individuals if and only if f(n) 
f) From this he deduces positive solutions to Scholz's problem in a 
number of special cases. 

It is usually considered that A. Mostowski really proved 

Theorem 2.6. All sets of natural numbers, whose characteristic func- 
tions are in the second level of the Grzegorzcyk Hierarchy £ 2 , are first 
order spectra. 

The detailed definitions and contents of this theorem will be discussed 
in Section HI 

In the last 50 years a steady stream of papers appeared dealing with 
spectra of first order and higher order logics. The problem seems not too 
important at first sight. However, some of these papers had consider- 
able impact on what is now called Finite Model Theory and Descriptive 
Complexity Theory. 

Open Question 1 (Scholz's Problem). Characterize the sets of natu- 
ral numbers that are first order spectra. 

Scholz's Problem, as stated, is rather vague. He asks for a characteri- 
zation of a family of subsets of the natural numbers without specifying, 
what kind of an answer he had in mind. The answer could be in terms 
of number theory, recursion theory, it could be algebraic, or in terms of 
something still to be developed. We shall see in the sequel many solutions 
to Scholz's Problem, but we consider it still open, because further answers 
are still possible. 

The same question can be asked for any logic, in particular second order 
logic SO, or fragments thereof, like monadic second order logic MSOL, 
fixed point logic, etc., as discussed in [36, 84]. 

Open Question 2 (Asser's Problem). Is the complement of a first or- 
der spectrum a first order spectrum ? 
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Here the answer should be yes or no. 

The corresponding problem for SO has a trivial solution. Let 4> £ SO(r) 
with r = {R\, ... , Rk}- An integer n is in spec(^) iff 

n |= 3i?i3i? 2 . . .3R k <f>. 

Then, the complement of spec(^) is easily seen to be the spectrum of the 
SO sentence ~^(3Ri . ■ ■ 3Rk<p)- In passing, note that every SO-spectrum 
is a SO(t) over a language r containing equality only. 

However, for fixed fragments of SO, Asser's Problem remains open. In 
particular 

Open Question 3. Is the complement of a spectrum of an MSOL- 
sentence again a spectrum of an MSOL-sentence? 

2.4. Approaches and themes. In this survey we shall describe the 
various solutions and attempts to solve Scholz's and Asser's problems, and 
the developments these attempts triggered. We shall emphasize more 
the various ways the questions were approached, and focus less on the 
historical order of the papers. 

There are several discernible themes: 

Recursion Theory: The early authors H. Asser and A. Mostowski 
approached the question in the language of the theory of recursive 
functions i.e. they looked for characterization of spectra in terms of 
recursion schemes, or hierarchies of recursive functions. Most promi- 
nently in terms of Kalmar's elementary functions, the Grzegorczyk 
hierarchy and hierarchies of arithmetical predicates, in particular 
rudimentary relations. This line of thought culminates in 1962 in 
the thesis of J. Bennett [90. Although G. Asser already character- 
ized first order spectra in such terms, his characterization was not 
considered satisfactory even by himself, because it did not use stan- 
dard terms and was not useful in proving that a given set of integers 
is (or not) a spectrum. We shall discuss the recursion theoretic ap- 
proach in detail in Section [H 

Complexity Theory: In the 1970s, D. Rodding and H. Schwichten- 
berg of the Miinster school [113] gave a sufficient but not necessary 
condition: any set of integers recognizable by a deterministic lin- 
ear space-bounded Turing machine is a first-order spectrum. (This 
is also a consequence of results of Bennett and Ritchie [9, 110], ob- 
tained before the emergence of complexity theory.) Further, Rodding 
and Schwichtenberg showed that sets of integers recognisable using 

It seems that some of Bennet's unpublished results were rediscovered independently 
in China in the late 1980ties by Shaokui Mo [95]. We shall discuss his work in Section 
1X1 
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larger space bounds are higher order spectra. C. Christen developed 
this line further [18, 19], independently obtaining a number of the 
following results. 

At the same time the spectrum problem gained renewed interest in 
the USA. In 1972 A. Selman and N. Jones found an exact solution to 
Scholz's original question [79]: a set of integers is a first order spec- 
trum if and only if it is recognizable by a non-deterministic Turing 
machine in time 0(2 c ' n ). 

This result was independently also obtained by R. Fagin in his 
thesis [40], which contains an abundance of further results. Most 
importantly, R. Fagin studies generalized spectra, which are the pro- 
jective classes of Tarski, restricted to finite structures, and really laid 
the foundations for Finite Model Theory and Descriptive Complex- 
ity, as can be seen in the monographs [76, 36, 84]. We shall discuss 
the complexity theoretic approach in detail in Section 

Images and preimages of spectra: From Proposition 12.41 it follows 
that, if S is a first order spectrum and p is a polynomial with positive 
coefficents, then p(S) = {p(m) : m £ S} is also a spectrum. In J. 
Bennett's thesis it is essentially proved that there is a first order 
spectrum S and an integer k such that {n : 2 n 6 S} is not a 
spectrum. It is natural to ask what happens to a spectrum under 
images and preimages of number theoretic functions. The general 
line of this type of results states that certain images or preimages of 
spectra of specific forms of sentences are or are not spectra of other 
specific forms of sentences. 

Spectra of syntactically restricted sentences: Already in a paper 
by L. Lowenheim from 1915 [87] it is noted that, what later will 
be called the spectrum of a sentence in monadic second order logic 
(MSOL) with unary relation symbols only, is finite or cofinite. The 
set of even numbers is the spectrum of an MSOL sentence with one 
binary relation symbol, and it is ultimately periodic. Further, every 
ultimately periodic set of positive integers is a spectrum of a first 
order MSOL sentence with one unary function symbol. Over the 
last fifty years various papers were written relating restrictions on 
the use of relation and function symbols, or other syntactic restric- 
tions, to special forms of spectra. R. Fagin, in his thesis, poses the 
following problem 



Open Question 4 (Fagin's Problem for binary relations). Is ev- 
ery first order spectrum the spectrum of a first order sentence of one 
binary relation symbol? 
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The question is even open, if restricted to any fixed vocabulary that 
contains at least one binary relation symbol or two unary function 
symbols. 

Much of this line of research is motivated by attempts to solve 
Fagin's problem. 

Transfer theorems: Another way of studying spectra is given by the 
following result, again from Fagin's thesis: If S is a spectrum of a 
purely relational sentence where all the predicate symbols have arity 
bounded by k, then S k = {m k : m 6 S} is a spectrum of a sentence 
with one binary relation symbol only, or even a spectrum on sim- 
ple graphs. One can view this an approach combining the study of 
images and preimages of spectra with either syntactically or seman- 
tically restricted spectra. Over the years quite a few results along 
this line were published. We shall discuss the last three approaches 
under the common theme of restrictions on vocabularies in detail in 
Section [H 

Spectra of semantically restricted classes: R. Fagin shows that 
Asser's problem has a positive answer if and only if it has a pos- 
itive answer if restricted to the class of simple graphs. Similarly, in 
order to understand Fagin's problem better, one could consider re- 
stricted graph classes K, and study first order spectra restricted to 
graphs in K. One may think of graphs of bounded degree, planar 
graphs, trees, graphs of tree-width at most k, etc. 

Open Question 5 (Fagin's Problem for simple graphs). Is every 
first order spectrum the spectrum of a first order sentence over simple 
graphs ? 

Open Question 6. Is every first order spectrum the spectrum of 
a first order sentence over planar graphs? 

For restrictions to graph classes of bounded tree-width, the an- 
swer is negative. The reason for this is that spectra of graphs of 
bounded tree-width are ultimately periodic. In fact, this holds for 
a much wider class of spectra. E. Fischer and J. A. Makowsky, [49], 
have analyzed under what conditions MSOL-spectra are ultimately 
periodic. We shall discuss their results in detail in Section El 

This line of thought has not been extensively explored, this may 
well be a fruitful avenue for studying spectra in the future. 

In the sequel of this survey we shall summarize what is known about 
spectra along these themes. Various solutions to Scholz's Problem were 
offered in the literature, varying with the tastes of the times, but there 
may be still more to come. Asser's and Fagin's Problems are still open. 
Both problems are intimately related to our understanding of definability 
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hierarchies in Descriptive Complexity Theory. They may well serve as 
benchmarks of our understanding. 

§3. Understanding Spectra: counting functions and number 
theory. In this section we formulate various ways to test our under- 
standing of spectra. It will turn out that there still many questions we 
do not know how to answer. 

3.1. Representation of spectra and counting functions. Spectra 
are sets of positive natural numbers. These sets can be represented in 
various ways. We shall use the following: 

Definition 3.1. Let M C N + , and let mi, 1712, ... an enumeration of 
M ordered by the size of its elements. 

(i) XM{n) is the characteristic function of M, i.e., 



(hi) 7m (n) is the counting function of M, i.e., 7m (n) is the number of 
elements in M that are strictly smaller than n. 

(iv) A gap of M is a pair of integers g\ , g<i such that g\ , g<i 6 M but for 
each n with g\ < n < g^ we have that n M. Now let <5m(^) be the 
length of the nth gap of M. Clearly, 5m (n) = t?m(« + 1) — VM^n). 

Obvious questions are of the following type: 

Open Question 7. Which strictly increasing sequences of positive in- 
tegers, are enumerating functions of spectra? For instance, how fast can 
they grow? 

Open Question 8. If M is a spectrum how can (5m(«) behave? 

Coding runs of Turing machines one can easily obtain the following. 

Proposition 3.2. For every recursive monotonically increasing func- 
tion f there is a first order formula 4> such that 5<p(n) = f(n). 

Various other partial answers to these questions will appear throughout 
our narrative. 




1 if n G M 
else . 



(ii) rjM{n) is the enumeration function of M, i.e., 




if it exists 
else . 
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3.2. Prime numbers. An obvious question is whether the primes 
form a spectrum. If one gets more ambitious one can ask for special 
sets of primes such as Fermat primes (of the form 2 2 +1), Mersenne 
primes (of the form 2 P — 1 with p a prime), or the set of primes p such 
that p + 2 is also a prime (twin primes). Even if we do not know whether 
such a set is finite, which is the case for twin primes, it may still be possi- 
ble to prove that it is a spectrum. The answer to all these question is yes, 
because all these sets are easily proved to be rudimentary, see Section [H 

In the sense of the above definitions we have Xprimes is the character- 
sitic function of the set of primes, r] pr i mes (n) = p n , and 7 pr imes ("-) is the 
counting function of the primes, usually denoted by Tv(n). 5 pr i mes (n) is 
usually denoted by d n . All these functions related to primes are subject 
to intensive study in the literature, see eg. [108]. As we have said that 
the primes form a first order spectrum, all the features of these functions 
observed on primes do occur on spectra. 

For instance, ir(n) is approximated by the integral logarithm li(n), and 
it was shown by J.E. Littlewood in 1914, cf. [108] that %{n)—li{n) changes 
sign infinitely many often. For logical aspects of Littlewood's theorem, 
see [82]. 

Let us define 

7T+ = {n : n(n) - li(n) > 0} 
7r~ = {n : 7r(n) — li(n) > 0} 
A less obvious question concerning spectra and primes is 
Open Question 9. Are the sets ir + and ir~ spectra? 

3.3. Density functions. Many combinatorial functions are defined 
by linear or polynomial recurrence relations. Among them we have the 
powers of 2, factorials, the Fibonacci numbers, Bernoulli numbers, Lucas 
numbers, Stirling numbers and many more, cf. [57]. 

Question 3.3. Are the sets of values of these combinatorial functions 
first order spectra? 

The answer will be yes in all of these cases. We shall sketch a proof in 
Section [A.3I that is based on the existence of such recurrence relations. 

But these functions also allow combinatorial interpretations as count- 
ing functions: The powers of 2 count subsets, the factorials count linear 
orderings, the Stirling numbers are related to counting equivalence rela- 
tions. We shall see below that in these three examples the combinatorial 
definitions allow us to give alternative proofs that these sets of numbers 
are first order spectra. 

The spectrum of a sentence 4> witnesses the existence of models of 4> of 
corresponding cardinalities. Instead, one could also ask for the number 
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of ways the set {0, 1, . . . , n — 1} = [n] can be made into a model of <p. 
Alternatively one could count models up to isomorphisms or up to some 
other equivalence relation. 

Combinatorial counting functions come in different flavours; 

Definition 3.4. Let C be a class of finite r-structures. With C we 
associate the following counting functions: 

(i) fc(n) is the number of ways one can interpret the relation symbols 
of r on the universe [n] such that the resulting structure is in C. This 
corresponds to counting labeled structures. 

(ii) Let Str(T)(n) denote the number of labeled r-structures of size n. 
We put 

, / \ fc{n) 

probc(n) = SMtTR 

which can be interpreted as the probability that a labeled r-structure 
of size n is in C. 

(iii) f(f°(n) is the number of non-isomorphic models in C of size n. 

(iv) For an equivalence relation E on C we denote by (n) the number 
of non-E-equivalent models in C of size n. 

(v) If E is the /c-equivalence from Ehrenfeucht-Frai'sse games, f^{n) is 
denoted by Nq k(n), and is called an Ash- function, cf. [6] and Section 

m 

(vi) If C consists of all the finite models of a sentence (f> we write ff{n) 



instead of f^(n). Similarly for 



n) 



Counting labeled and non-labeled structures has a rich literature, cf. 
[68, 130]. Note that counting non-labeled non-isomorphic structures is in 
general much harder than the labeled case. The first connection between 
counting labeled structures and logic is the celebrated 0-1 Law for first 
order logic: 

Theorem 3.5 (0-1 Laws). For every first order sentence (p over a purely 
relational vocabulary r we have: 

(i) (Y. Glebskii, D. Kogan, M. Liogonki and V. Talanov [53]; R. Fagin 

[40]) 

lim probJn) = < 

n^oo v ] 

and the limit always exists. 

(ii) (E. Grandjean [58]) Furthermore, the set of sentences 4> such that 
limn^oo prob^{n) = 1 is decidable, and in fact PSpace-complete. 

What we are interested in here, is the relationship of such counting 
functions to spectra. Our example of powers of 2 shows that 

2 n = U(n)= V ^n) 
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where eft is an always-true first order sentence with one unary relation 
symbol, and ift is the conjunction of the axioms of Boolean algebras. Sim- 
ilarly, 



where 4>lin are the axioms of linear orders, and ip describes the following 
situation: 

(i) P is a unary relation and R is an linear order on P. 

(ii) E is a ternary relation that is a bijection between the universe (first 
argument x) and all the linear orderings on P (remaining two argu- 
ments y, z). 

(iii) First we say that there is an x that corresponds to R; and that for 
x 7^ x' the orderings are different. This says that E is injective. 
To ensure that we get all the orderings on P we say that for every 
ordering and every transposition of two elements in this ordering, 
there is a corresponding ordering. 

Hence, the size of the model of ip is the number of linear orderings on P. 

Clearly, if is not strictly increasing, there is no ip with /^(n) = rj^{n). 
For instance, for <p> which says that some function is a bijection of a part 
of the universe to its complement, we have 



Open Question 10. Let <p a first order sentence, and f^ be the asso- 
ciated labeled counting function that is monotonically increasing. Is there 
a first order sentence ip such that for all n 



The converse question seems more complicated. For instance, as we 
have noted before, the primes p n are of the form r/^, for some first order 
ip, but we are not aware of any labeled counting function that will produce 
the primes. 

R. Fagin [41] calls (p categorical if f^°(n) < 1 for every n. For instance, 
4>lin is categorical. The counting function up to isomorphisms can be 
bounded by any finite number m, using disjunctions of different categor- 
ical sentences. So it may be less promising to study for which first order 
sentences <p there is a ip such that fl so (n) = %(n), or vice versa. 

Surprisingly enough, C. Ash [6] has found a connection between Asser's 
Problem and the behaviour of the Ash functions defined in Definition 
I3,4tjvj) . We shall discuss this in Section [71 



nl = /<?w( n ) = ^( n ) 




Ufa) = Vi>{n) 
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3.4. Sentences with prescribed spectra. In the light of Theo- 
rem !3.5l we note that if probAn) tends to 1 then spec(4>) is cofinite. Obvi- 
ously, the converse does not hold, because there are categorical sentences 
with models in all finite cardinalities. 

Trakhtenbrot's Theorem says that it is undecidable whether a spectrum 
is empty, and Grandjean's Theorem says that it is decidable, whether a 
sentence is almost always true, i.e. probAn) tends to 1. As a partial 
answer to Questions 12.31 and 12.51 we have: 

Proposition 3.6. Let eft be a first order sentence. The following are 
undecidable: 

(i) spec(c/>) is finite, cofinite. 

(ii) SPEC(0) is ultimately periodic. 

(iii) spec((/>) is, for given a, b 6 N of the form a + 6N. 

(iv) SPEC(0) = S for a given set S C N + . 

Sketch of Proof. Let (p e FO. We describe the construction of 
FO-sentences ipi, tp2, ip3, ipi and ^5 such that SPEC((/?) = if and only if: 

- SPEc(Vr) is finite. 

- SPEC(V^2) = N+. 

- More generally, SPEC(V>3) = {f(i) \ i S N + } for a given function / 
such that f(i) > i for all i and the graph n = f{i) seen as a binary 
relation is rudimentary (see Section f4.2l for a precise definition). 

- SPEC(V>4) is cofinite. 

- specks) is ultimately periodic. 

Since the problem of emptiness of spectra is undecidable, the announced 
result follows. 

Let and max be two constant symbols, let < be a binary predi- 
cate symbol and let + and x be two ternary predicate symbols. Let 
Arithm(0, max, <, +, x) denote a first-order sentence axiomatizing the 
usual arithmetic predicates. Our sentences ipi (i = 1,... ,3) consist of 
the conjunction of Arithm(0, max, <,+, x) with a specific part ipl that 
we will describe below. We shall use the fact that the Bit predicate is 
definable from + and x in finite structures, as well as the ternary rela- 
tion a = b c . For simplicity, we will assume w.l.o.g. that the signature 
of (p consists of a binary relation R only. Let n and y be new variable 
symbols. Let (p'(n,y) be the formula obtained from ip by replacing every 
quantification \/x by Vx < n and 3x by 3x < n, and every atomic formula 
R(x,x') by Bit(y,x + nx'). The idea is that a graph R on a set of n 
elements seen as {0, ... , n — 1} is encoded by the number y < 2" written 
in binary with a 1 in position a + bn if and only if R(a, b) holds. Hence 



2 Bit(a, b) is true iff the bit of rank 6 of a is 1. 
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for all n G N + , we have 3y < 2™ <p'(n, y) if and only if ip has a model with 
n elements. 

- Let ip[ = 3m, n,y < max(max = 3 m x 2™ 2 A y < 2™ 2 A <p'(n, y)). 

It is easy to verify that if SPEC(^) = 0, then spec(Y>i) is also 
empty (hence finite), and conversely, if SPEC(y) ^ 0, then spec(V'i) 
contains all the integers of the form 3 m x 2™ for some m G N + and 
n G SPEC((^), i.e. spec(V>i) is infinite. 

- Let ip' 2 = (V n < max max / 2 n ) V (3n < max(max = 2 n A Vy < 
2« 2 V(™,2/)))- 

If spec((^) = 0, then for all n and y, the condition 3y < 2 n ip'(n, y) 
is false, hence SPEC(V>2) = N + . Conversely, if spec(<^) / 0, then the 
integers of the form 2™ with n G SPEC(<^) are not in spec(t/ ; 2)- 

- Since the binary relation y = f(x) is rudimentary, it is definable 
from + and x in finite structures. Let ifj' 3 = 3i < max(max = 
/(«) A ((Vn < i i ^ 2™ 2 ) V (3n < i(i = 2 n " A Vy < 2 n " VK </)))))• 

The verification that spec((^) = if and only if spec(t/>3) = 
{/(i) | i G N + } is similar to the previous case. 

- Let %p' A = Vn < max(2 n2 < max — ► Vy < 2™ 2 -iy?'(n, y)). 

If SPEc(y) = 0, then for all n and y, the condition 3y < 2 n p'(n, y) 
is false, hence SPEC(^4) = N + (hence is cofinite). Conversely, if 
SPEC((^) / 0, then the integers greater than 2™ with n G SPEC(</?) 
are not in spec(V'2)) which is not cofinite. 

- Let ip' b = 3n, m < max(max = 2™ x m 2 f\3y < 2 n tp'(n, y) A Vn' < 
nVy' < 2 n2 ^<p'(n',y'))). 

If SPEC((/9) = 0, then spec^) = (hence is ultimately periodic). 
Conversely, observe that specks) = {no x m2 I m £ f^+} 5 where 
no = inf (spec (</?)). Hence specks) is not ultimately periodic. 

H 

3.5. Real numbers and spectra. Let x<j>{ n ) be the characteristic 
function of the spectrum of a first order sentence <p. We can associate 
with 4> and a£Z the real number = a + ^ n x§(n)2r n . 

Definition 3.7. A real number is first order spectral if it is of the form 
for some a£Z and some first order sentence <p. 

As we have noted ultimately periodic sets of natural numbers are first 
order spectra, and correspond to rational numbers. Also every ultimately 
periodic spectrum can be realized by a formula with one function symbol 
only. We have 

Proposition 3.8. Every rational number q is first order spectral using 
a formula with one function symbol only. 
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Question 3.9. Do the the first order spectral reals form a field? 

E. Specker, [122], proved that there is a real x primitive recursive in base 
2 such that 3x, x + |, x 2 are not primitive recursive in base 2. A modern 
treatment can be found in [16]. H. Friedman [51] mentioned on an inter- 
net discussion site that primitive recursive can be replaced in Specker's 
Theorem by much lower complexity within the Grzegotczyk Hierarchy. J. 
Miller kindly provided us, [93], with the more precise statement 

Theorem 3.10 (E. Specker, 1949 and H. Friedman 2003). 

There is a real x which is primitive recursive in base 2 (and can even be 
taken to be in £ 2 of the Grzegorczyk Hierarchy), such that 3x, x + ^, x 2 
are not primitive recursive in base 2. 

Sketch of proofH. This can be proved by exploiting the fact that 
none of these functions 3x,x 2 ,x + 1/3 are continuous as functions on 
binary expansions of reals. They can take a number x that is not a 
binary rational to one that is. 

Let us focus on 3x. So, for example, if x = 0.0101010101 . . . , then 
3x = 1. We can exploit this as follows. Say we have built the binary 
expansion of x up to position n — 1 and it looks like 0.6 (where b is a 
finite string) and that we want to diagonalize against the zth primitive 
recursive function pi. Compute Pi(n), step by step. As long as it does 
not converge, keep building x to look like 0.60010101010101 .... If Pi(n) 
converges at stage s, then use position n + 2s or position n + 2s + 1 to 
spring the delayed trap. If pi(n) = 0, then let x = 0.60010101 . . . 01011. 
If Pi (n) = 1, then let x = 0.60010101 ... 0100. Either way, pi(n) does 
not correctly compute the nth bit of 3x. Note that we spread out the 
unbounded search, so that each bit is computed by a bounded (primitive 
recursive) procedure. 

In this way we can diagonalize against 3x, x 2 and x+1/3 being primitive 
recursive while making x primitive recursive. To make x to be in £ 2 one 
uses the fact that £ 2 is the same as computable in linear space [110]. 

In the argument above, when we are trying to figure out bit t of x, we 
compute Pi(n) (for some i and n determined earlier in the construction of 
x) for t steps and if it does not halt we output the default bit (alternating 
between and 1), so it can be made in linear time. Actually, in the case 
of x a 2, one has to work a little harder to determine the default bit, but 
this can definitely be done in linear space (and polynomial time). H 

We shall see in Section HI Theorem 14.111 that Theorem 13.101 covers all 
the spectral reals, therefore the spectral reals do not form a field. More 
precisely we have the following Corollary: 

Corollary 3.11. The spectral reals are not closed under addition nor 
under multiplication. Furthermore, they are closed under the operation 
1 — x iff the complement of a spectrum is a spectrum. 
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We now turn the question of algebraicity and transcendence of spectral 
reals. Clearly, every first order spectral real is a recursive real in the 
sense of A. Turing [127]. Using Liouville's Theorem^, we can see that 
many transcendental reals are first order spectral. 

Open Question 11. Are there any irrational algebraic reals which are 
spectral? 

One way of analyzing irrational numbers is by counting the number 
of Is in their binary representation. For a real r £ (0, 1) let J r (n) be 
the number of Is among its first n digits. If r = is spectral we have 
7r (n) = jp h i{n). 

In the sequel we follow closely and quote from M. Waldschmidt [129]. 

Theorem 3.12 (Bailey, Borwein, Crandall, and Pomerance, 2004, [8]). 
Let r be a real algebraic number of degree d > 2. Then there is a positive 
number C r ^, which depends only on r, such that r y r ( n ) > Cr,dn>d . 

In other words, if a spectral number r^ is algebraic of degree d > 2, 
then 7^ (n) > C^rid, for some positive number C^. 

To get more information about irrational numbers r we have to look at 
the binary string complexity of r £ (0,1). We consider r as an infinite 
binary word. 

Definition 3.13 (Binary string complexity). The binary string com- 
plexity of r is the function p r (m) which counts, for each m the number 
of distinct binary words w of length m occuring in r. Hence we have 
1 < p r (m) < 2 m , and the function p r (m) is non-decreasing. 

Conjecture 12 (E. Borel 1950, [12]). The binary string complexity of 
an irrational algebraic number r should be p r {fn) = 2 m . 

Definition 3.14. We call a real number r £ (0,1) automatic if the 
n-th bit of its binary expansion can be generated by a finite automaton 
from the binary representation of n. 

Clearly, the binary string complexity of an automatic real is 0(m). 

Open Question 13. Is every automatic real a spectral real? 

In 1968 A. Cobham, [20] conjectured that automatic numbers are tran- 
scendental. This was proven in 2007 by B. Adamczewski and Y. Bugeaud, 
[1]. They actually proved a stronger theorem. 

Theorem 3.15 (B. Adamczewski and Y. Bugeaud, 2007). The binary 
string complexity p r (m) of a real irrational algebraic number r satisfies 

lim inf ^ r ^ m ^ = _)_cx) 
m— >oo rn 

4 Liouville's Theorem states, in simplified form, that a real of the form r = 2~- f< -"' ) 
where /(n) > n! is transcendental. 
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Borel's Conjecture would imply that the binary string complexity p r (m) 
of a real irrational algebraic number r satisfies 

lim inf — - = 1 

m— >oo 2 m 

Open Question 14. Does the binary string complexity p r (m) of a spec- 
tral real r satisfy 

hminf^M<l 

or even 

liminf^M = 0? 

From Theorem 13.151 one gets that the Fibonacci numbers, which will 
be shown to form a spectrum in Corollary 14.121 of Section 14.31 give us a 
transcendental spectral number. More generally, we get the following: 

Proposition 3.16. Let be a spectral real such that the gap function 
5^(n) is monotonically increasing and grows exponentially. Then p^n) = 
0(n). Therefore, r^ is transcendental. 

The analysis of computable reals in binary or 6-adic presentation is 
tricky because of the behaviour of the carry, cf. [17]. Let J 7 be a class of 
functions / : N — ► N. 

Definition 3.17. 

A real number a is called J- '-Cauchy computable if there are functions 
f,g,h £ T such that for 

_ f(n) ~ g(n) 
Tn h(n) + 1 

we have that for all n £ N 

1 

\ r n — a < . 

1 '- n + 1 

A real number a £ [0, 1] in 6-adic presentation is called ^-computable if 
there is / : N -> {0, . . . ,6-1} such that 



a = 

nGN 

Note that it is not clear at all how to define spectral Cauchy reals. If T 
contains the function 2™ then the T computable reals in 6-adic presenta- 
tion are also ^-Cauchy computable. In particular, this is true for T = E % 
and i > 3. 

Open Question 15. Are the b-adic £ 2 -computable reals £ 2 -Cauchy com- 
putable? 
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Recently, £ 2 -Cauchy computable reals have received quite a bit of at- 
tention, cf. [119, 120]. The following summarizes what is known. 

Proposition 3.18 (D. Skordev). (i) The £ 2 -Cauchy computable re- 
als form a real closed field. 

(ii) The transcendental numbers e and ir, and the Euler constant 7 and 
the Liouville number ^ ng ^ 10 a —n\ are £ 2 -Cauchy computable. 

(iii) There are £ 3 -Cauchy computable reals which are not £ 2 -Cauchy com- 
putable. 

Let Tiow be the smallest class of functions in £ 2 which contains the 
constant functions, projections, successor, modified difference, and which 
is closed under composition and bounded summation. A real a is low if 
a is J-[ ow -Cauch.y computable. The low reals also form a real closed field. 
In [125] low reals are studied and some very deep theorems about low 
transcendental numbers are obtained, the discussion of which would take 
too much space. 

Open Question 16. Is the inclusion Ti ow C £ 2 proper? 
§4. Approach I: Recursion Theory. 

This approach has generated all in all four papers (namely [7] by G. 
Asser in 1955, [98] by A. Mostowski in 1956, [110] by R. Ritchie, and [95] 
by S. Mo in 1991) and two Ph.D. dissertations, namely [109] by R. Ritchie 
in 1960 and [9] by J. Bennett in 1962. These works share the common 
feature of being hardly available for many readers on various grounds: 
Asser's and Mostowski's papers are difficult to read because they are more 
than fifty years old and Asser's paper is in German. Bennett's thesis, 
cited in many papers, is almost equally old and in addition has remained 
unpublished. Finally, Mo's paper, though more recent, is in Chinese. 
This is the reason why we propose in Section [A] a detailed review of these 
references, including several sketches of proofs in modern language. In the 
present section, after some background material, we present a synthetic 
survey of the recursive approach of the spectrum problem. 

4.1. Grzegorczyk's Hierarchy. For a detailed presentation of the 
material in this subsection, see eg. [114]. A. Grzegorczyk's seminal paper 
[65] about classification of primitive recursive functions was published 
in 1953, one year after Scholz's question, and two years before Asser's 
paper. Hence, Grzegorczyk's Hierarchy was not the standard way to 
consider primitive recursive functions in the mid-fifties. And actually, G. 
Asser and A. Mostowski deal with recursive aspects of spectra, but not 
explicitly with Grzegorczyk's classes, though it is the usual framework in 
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schema: 



which their results are presented. It is only in J. Bennett's thesis in 1962 
and especially in S. Mo's paper in 1991 that one finds an explicit study 
of spectra in terms of Grzegorczyk's classes. 

In the sequel a function is always intended to be a function from some 
N fc to N (total, unless otherwise specified). 

Definition 4.1 (Elementary functions) . The class £ of elementary func- 
tions is the smallest class of functions containing the zero, successor, pro- 
jections, addition, multiplication and modified subtraction functions and 
which is closed under composition and bounded sum and product (i.e. 
f(n,x) = YH=o9(h%) an d f{ n -,x) = nr=o w ith previously defined 
g). We denote by the elementary relations, i.e. the class of relations 
whose characteristic functions are elementary. 

The class £ was introduced by Kalmar [81] and Csillag [28] in the forties, 
and contains most usual number-theoretic functions. It also corresponds 
to Grzegorczyk's class £ 3 , that we define below. 

Definition 4.2 (Primitive recursion). Let f,g,h be functions. We say 
that / is defined from g and h by primitive recursion when it obeys a 
f(0,x) = g(x) ' 
f(n+l,x) = h(n,x,f(n,x)) 
The class of primitive recursive functions, denoted by VR, is the small- 
est class of functions containing the zero function, the successor function, 
the projection functions, and which is closed under composition and prim- 
itive recursion. 

For instance, elementary functions are primitive recursive. The follow- 
ing binary function Ack, known as Ackermann's function, is provably not 
primitive recursive, whereas all unary specialised functions Ack x : y t— ► 
Ack(x, y) are primitive recursive: 
Ack(Q,y) =y + l 
Ack(x + 1,0) = Ack(x,l) 
Ack(x + 1, y + 1) = Ack(x, Ack(x + 1, y)) 

In order to introduce Grzegorczyk's hierarchy, we need a weaker version 
of primitive recursion, in which the newly defined functions have to be 
bounded by some previously defined function. 

Definition 4.3 (Bounded recursion). Let f,g,h,j be functions. We 
say that / is defined from g, h and j by bounded recursion when it obeys 

' f(0,x) = g{p!) 

a schema: < f(n+l,x) = h(n,x, f(n,x)) 

, f(n,x) < j(n,x) 

Let f n (n = 0, 1, 2, . . . ) be the following sequence of primitive recursive 
functions : 
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- k{x,y) = y + 1, 

- fi{x,y) =x + y, 

- f 2 (x,y) = {x + l)-(y+l), 




= f k+2 (y + l,y+l) 
= fk+3(,x,f k+3 (x,y)) 



Roughly speaking, the important feature is that the functions f n are 
more and more rapidly growing. Several other similar sequences of in- 
creasingly growing functions can be used to define Grzegorczyk's classes. 

Definition 4.4 (Grzegorczyk's hierarchy). The Grzegorczyk's class £ n 
is the smallest class of functions containing the zero function, the projec- 
tions functions and f n and which is closed under composition and bounded 
recursion. The associated classes of relations £" are defined as the class 
of relations on integers with a characteristic function in £ n . 

Note that, for sake of simplicity, we use the same notation for a class of 
relations of various arities (eg. £+) and the class of unary relations (i.e. 
sets) it contains. Which one is intended will always be clear from the 
context. 

The main features of Grzegorczyk's classes were studied by A. Grzegor- 
czyk in [65] and by R. Ritchie in [110]. 

Theorem 4.5 (A. Grzegorczyk (1953)). 

- The functional hierarchy is strict for n > 0, i.e. we have £ n C £ n+l . 

- The relational hierarchy is strict for n > 3, i.e. we have £™ C 

- For the initial levels of the relational hierarchy, we have ^° C ^ C 



- The Kalmdr-Csillag class of elementary functions £ is equal to £ . 

- Finally, the full hierarchy corresponds to primitive recursion, i.e. 



Theorem 4.6 (R. Ritchie (1963)). We have S* / £l [110]. 

Note that the possible separation of the relational classes is 
still an open question. 

Open Question 17. Are the inclusions in 



An important point is that the functional hierarchy deals with the rate 
at which functions may grow: intuitively, functions in the low level of 
the hierarchy grow very slowly, while functions higher up in the hierarchy 
grow very rapidly. However, this feature does not hold for the relational 
hierarchy, because characteristic functions do not grow at all (they are 
— 1 valued). For instance, the ternary relations z = x + y, z = xxy, z = 




VR = {] 



l+oo cn 
] n=0 c 



e° c si c £l 



proper? 
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x y as well as z = Ack{x,y) all belong to £®, whereas the corresponding 
functions provably do not lie in £°. 

4.2. Rudimentary relations and strictly rudimentary relations. 

In addition to primitive recursive classes of relations, we also introduce 
two new classes of relations with an arithmetical flavour, namely the rudi- 
mentary and strictly rudimentary relations. These classes were originally 
introduced by R. Smullyan [121], and a major reference about rudimen- 
tary relations and subclasses is J. Bennett's thesis [9]. 

Definition 4.7 (Rudimentary relations). Denote by Rud the smallest 
class of relations over integers containing the graphs of addition and mul- 
tiplication (seen as ternary relations) and closed under Boolean operations 
(-i, A, V) and bounded quantifications (\/x < y . . . and 3x < y . . . ). 

In spite of its very restricted definition, the class Rud is surprisingly 
robust (eg. it has several equivalent definitions in the fields of computa- 
tional complexity, recursion theory, formal languages etc.) and large. For 
instance, the following formula defines the set of prime numbers: 

x > 1 A \/y < x y z < x ^{ x = y.z ) 

More (sometimes VERY) sophisticated formulas prove that the ternary 
relation z = x y is rudimentary (Bennett [9]), as well as the graph z = 
Ack(x,y) of Ackermann's function (Calude [13]), which is not primitive 
recursive (as a function), or the four-ary relation x y = z [mod t] (Hesse, 
Allender, Barrington [71]). Actually, we are not aware of a natural number 
theoretic relation which is provably not rudimentary. 
The following is easy to see. 

Proposition 4.8. Rud c£°c £\ c S%. 

However, the equality remains an open question (and would imply 
Rud = as well, since the closure of £® by polynomial substitution 
is £l whereas Rud is closed under polynomial substitution). 

Open Question 18. Are the inclusions in Rud C £° C £\ C 
proper? 

It remains to introduce the strictly rudimentary relations. Let us con- 
sider the dyadic representation of integers, i.e. n G N is represented by 
a word in {1,2}*. Compared to binary notation, dyadic notation avoids 
the problem of leading Os and yields a bijection between integers and 
words. When integers are seen as words, it is natural to consider subword 
quantifications instead of ordinary bounded quantification. We say that 
w = w± . . . Wk is a subword of v = v\ . . . v p and we denote w \ v when there 
exists 1 < i < p such that w± = Vi, . . . , = Vi + k-i- Of course, if x \y, 
then x < y. 
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Definition 4.9 (Strictly rudimentary relations). Denote by Srud the 
smallest class of relations over integers containing the graphs of dyadic 
concatenation (seen as a ternary relation) and closed under Boolean op- 
erations (-1, A, V) and subword quantifications (Vx \y . . . and 3x\y . . .). 

There are only few examples of strictly rudimentary relations, e.g. x 
begins (or ends or is a part of) y (as dyadic words), x = y, the dyadic 
representation of x is a single symbol, the dyadic representation of x 
contains only one type of symbol. On the other hand, several relations 
are provably not strictly rudimentary such as x < y, x = y + 1, x and y 
have the same dyadic length and the dyadic representation of x is of the 
form l n 2 n for some n (V. Nepomnjascii 1978, see [101]). 

Note that rudimentary relations were originally (and equivalently) de- 
fined by Smullyan [121] using dyadic concatenation as a basis relation 
instead of addition and multiplication. Clearly, we have Srud C Rud. 

4.3. Recursive and arithmetical characterizations of spectra. 

In the fifties and sixties, following the tastes of their time, logicians aim at 
characterizing Spec via recursion and arithmetics. Typically, they wished 
to obtain the characteristic functions of spectra as the 0-1-valued func- 
tions in a class defined by closure of a certain set of simple functions un- 
der certain operators (such as composition or various recursion schemas). 
From this point of view, their results are not totally satisfactory because 
they are either partial, or somehow cumbersome or unnatural. However, 
these studies show that the class of spectra is very broad, and that most 
classical arithmetical sets are spectra. 

The class Spec is set within Grzegorczyk's hierarchy (by G. Asser in [7] 
and A. Mostowski in [98]), from which we can deduce that all rudimentary 
sets are spectra. 

Theorem 4.10 (G. Asser (1955)). Spec C 

Asser's theorem is based on a rather complicated and artificial arith- 
metical characterization of spectra (see Subsection lA.lj) . In particular, 
Asser's construction is of no help in proving that a particular set is (or 
not) spectrum. 

Though he actually uses a slightly different class (see Subsection IA.2[) , 
the following result is usually attributed to Mostowski: 

Theorem 4.11 (A. Mostowski (1956)). S% C Spec 

Note that equality in Mostowski's theorem remains an open question. 

Open Question 19. Is the inclusion in El C Spec proper? 

The following corollary is not stated by A. Mostowski, but can be found 
in J. Bennett's thesis. It is worth noting because one of the most fruitful 
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ways in proving that various arithmetical sets are spectra is to prove that 
they are actually rudimentary. 

Corollary 4.12. Rud c Spec 

For instance, any set defined by a linear or polynomial recurrence condi- 
tion, such as the Fibonacci numbers (i.e. those numbers appearing in the 
sequence defined by Uq = u\ = 1 and u n+ 2 = u n + u n+ \), is rudimentary 
(see [39]). From Corollary 14.12^ we deduce that such sets are spectra, as 
announced in Section [3l Similarly, using the fact that the set of prime 
numbers is rudimentary and the exponentiation has a rudimentary graph, 
one proves that the sets of Fermat primes (of the form 2 2 " + 1), Mersenne 
primes (of the form TP — 1 with p a prime), or twin primes (p prime such 
that p + 2 is also a prime) are rudimentary (hence also spectra). 

Note that the question of whether the inclusion in Corollary 14.121 is 
proper is still open. 

Open Question 20. Do we have Rud = Spec? 

This problem is further investigated in Subsubsection [6.3,31 

An arithmetic characterization of Spec in terms of strictly rudimentary 
relations is also given, among many other results (see Subsection IA.3p . by 
J. Bennett in his thesis. 

Theorem 4.13 (J. Bennett (1962)). 
A set S C N is in Spec iff it can be defined by a formula of the form 
3y <2 x3 R(x,y) for some j > 1, where R is in Srud. i.e., 

S = {x e N | 3y<2 xJ R(x,y)} 

for some R 6 Srud and j ' > 1. 

J. Bennett also characterizes spectra of higher order logics and shows 
that the union of spectra of various orders equals the class of elementary 
relations 

The characterization of spectra stated in Theorem 14. 131 is rather simple 
and elegant. However, once again, it is not really useful in proving that a 
given set is a spectrum, now because Srud is very restrictive. A somehow 
similar characterization of Spec using Rud instead of Srud would have 
been more powerful - but, one gets this way second-order spectra. 

Finally, let us note a late paper on the recursive aspect of spectra, 
namely [95], due to the Chinese logician Mo Shaokui in 1991 (see Subsec- 
tion [A]4]). The solution to Scholz's problem proposed there is of the same 
type as Bennett's characterization. However, the only bibliographic refer- 
ences in Mo's paper are Scholz's question [116] and Grzegorczyk's paper 
[65], so that it can be considered completely independent from all other 
contributions about spectra. Section lAl summarises this paper's results. 
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§5. Approach II: Complexity Theory. The spectrum problem, for- 
mulated in the early 1950s, predates complexity theory since the notions 
of time or space bounded Turing machines first emerged in the 1960s (see 
[70, 83]). However, the first results about complexity of spectra appeared 
very soon (see Subsection 15. 3p . and computational complexity characteri- 
sations of spectra were found, in at least three independent early contexts 
(see Subsection 15. 4p . Later on, several refinements and developments of 
these seminal results have been published (see Subsections 15.71 and 15 . 8[> . 

Turing machines and other standard models of computation operate on 
words, not on numbers. Let L C X* be a set of finite words over a fixed 
finite alphabet S. Without loss of generality we assume X = {0, 1}, and 
that input words have no leading zeros. 

The archetypical task, given a language L, is to study the complexity 
of deciding membership in L of a word function of the length \x\, 

i.e., asymptotic growth rate of the time, space or other computational 
resources needed to decide whether x G L. 

5.1. Complexity and spectra. In this section, for a fixed sentence 4>, 
the set of natural numbers SPEC((/>) is seen as the set of positive instances 
of a decision problem (given a number n, is there a model of (p with n 
elements?). 

When dealing with computational complexity, we convert spectra (sets 
of natural numbers) into languages (over alphabet {0, 1}). The spectrum 
problem can thus be rephrased as: What is the computational complexity 
of the decision problems for spectra? 

Complexity classes. Denote by NTIME(/(n)) (resp. DTIME(/(n))) the 
class of binary languages accepted in time 0(f(n)) by some non-determi- 
nistic (resp. deterministic) Turing machine, where n is the length of the 
input. Similarly, let us denote by DSPACE(/(n)) the class of languages 
accepted in space 0(/(n)) by some deterministic Turing machine. Some 
well-known complexity classes which concern us here are: 

L = DSPACE(logn) C NL = NSPACE(logn) 

LINSPACE = DSPACE(n) C NLINSPACE = NSPACE(n) 

P = (J DTIME(n c ) C NP = (J NTIME(n c ) 

C>1 C>1 

E = |J DTIME(2 c n ) C NE = |J NTIME(2 c ' n ) 

C>1 C>1 

Finally, if C denotes a complexity class, we denote its complement class, 
i.e. the class of binary languages L such that X* — L G C, by coC. 
Of course, the perennial open questions are: 

Open Question 21. (i) Are any of the inclusions 

L C NL, LINSPACE C NLINSPACE, P C NP and E C NE proper? 
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(ii) Do any of the equalities NP = coNP and NE = coNE hold? 

Surprisingly, the following was shown independently by N. Immermann 
and R. Szelepczenyi in 1982, cf. [76]: 

Theorem 5.1 (Immermann, Szelepczenyi 1982). 
NL = coNL and NLINSPACE = coNLINSPACE. 

In Section [6] we shall also make use of the polynomial time hierarchy 
PH and its linear analogue LTH. 

The class Rud lies between L and LINSPACE, and must be different 
from one of them. 

Proposition 5.2. 

(i) (Nepomnjascii 1970, [100]) L C Rud 

(ii) (Wrathall 1978, [132]) Rud = LTH 
(hi) (Myhill 1960, [99]) LTH C LINSPACE 

Open Question 22. 
Are the inclusions L C Rud = LTH C LINSPACE proper? 

Number representations by binary or unary words. It is natural to use 
binary notation for natural numbers (an alternative without leading zeros 
is Smullyan's dyadic notation [121]). The shortest binary length and 
dyadic length of the natural number n are very close to |~log 2 n\ , whereas 
its unary length is of course n, and we have n = 2 log2n . Consequently, 
the same (mathematical) computation that is performed by some Turing 
machine in time eg. 0(2 c 'l n l) when |n| is the binary length of the natural 
number input, is also performed (by a slightly different Turing machine) 
in time 0(n c ) when n is the (unary length of the) natural number input. 

Unary notation (also called tally notation, i.e. the number n is rep- 
resented by the word 1 ... 1 composed of n ones) also has its fans, for 
reasons explained in the description of Fagin's work. Most results in this 
section may be stated in either notation, but for sake of simplicity, and 
unless explicitly stated otherwise, we use binary notation. The length of 
a binary or unary word x is written \x\. 

Recall that Spec denotes the set of spectra of first-order sentences, i.e., 

Spec = {spec(</>) | <j> is a first-order sentence} 

5.2. Spectra, formal languages, and complexity theory. Formal 
language theory was much studied in the early 1960s, cf. [69], in particu- 
lar the Chomsky hierarchy. While the regular and context-free language 
classes were well-understood, several questions remained open for larger 
classes. We need here the following: 
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Theorem 5.3. 

(i) (Ritchie 1963, [110]) £* = LINSPACE 

(ii) (Kuroda 1964, [83]) A language L is context sensitive iff 
L G NLINSPACE. 

For our discussion one should remember that at that time it was then 
(as now) unknown whether LINSPACE = NLINSPACE and also unknown 
whether NLINSPACE was closed under complementation. The latter was 
only resolved positively more than 20 years later, see Theorem 15.11 

These open questions showed a tantalising similarity to Scholz' and 
Asser's questions. If we identify characteristic functions with sets, then 
Bennett's 1962 thesis combined with Asser, Mostowski and Ritchie's re- 
sults, yield 

LINSPACE C Spec C £* and LINSPACE C NLINSPACE C 

? 

This led to a conjecture Spec = NLINSPACE, that spectra might be 
coextensive to the context sensitive languages. The analogy fails, though, 
since more than n "bits of storage" are needed to store an n-element 
model M of a sentence <f>. 

5.3. An early paper. One of the first papers explicitly relating spec- 
tra to bounded resource machine models of computation is [113] (in Ger- 
man), due to Rodding and Schwichtenberg from Miinster in 1972. This 
switch from recursion theory to complexity theory had been prepared 
ten years before by Bennett and Ritchie, and Rodding and Schwichten- 
berg made a step further. The model of computation they use is not 
Turing machines, but register machines. As Bennett does, Rodding and 
Schwichtenberg not only consider spectra of first-order sentences, but also 
higher order spectra, namely spectra of sentences using i-th order vari- 
ables. Let us denote by HO— SPECj the class of spectra of sentences using 
i-th order variables. Let us define the following sequence of functions : 
let expo(n) = n, and expi+x(n) = 2 expi ^ n \ Along with other results in the 
field of recursion theory, Rodding and Schwichtenberg prove the following 
theorem. 

Theorem 5.4 (Rodding and Schwichtenberg 1972 [113]). For alii G N ; 
we have DSPACE(expj(n)) C HO— SPEQ+i. 

In particular, taking i = 0, first-order spectra are thereby shown to con- 
tain DSPACE(n). Let us finally remark that Rodding and Schwichtenberg 
did not consider non-deterministic complexity classes. 

5.4. First-order spectra and non-deterministic exponential time. 
Scholz's original question (see [116]) was finally answered after twenty 
years, when Jones and Selman related first-order spectra to non-determi- 
nistic time bounded Turing machines. Their result was first published in 
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a conference version in 1972 (see [79]), and the journal version appeared 
in 1974 (see [80]). The following theorem holds. 

Theorem 5.5 (Jones and Selman 1972 [79]). Spec = NE. 

This leads to a complexity theory counterpart of Asser's question: 

Corollary 5.6. Spec = coSpec if and only i/NE = coNE. 

They note that this does not answer Asser's question, but it shows the 
link with a wide range of closure under complement questions, in com- 
plexity theory. Presently, we know that many of them are very difficult 
questions. 

Proof ideas. To see that Spec C NE, consider spec(^) G Spec. 
Since the sentence <p is fixed, satisfaction M \= 4> can be decided in time 
that is at most polynomial in the size of model M, where the polynomial's 
degree depends on the quantifier nesting depth in cj). A simple guess- 
and-verify algorithm is: given number x, non-deterministically guess an 
x-element model M., then decide whether M. |= 4> is true. Time and space 
2 c ' n suffice to store an x-element model and check M |= (j), where constant 
c is independent of x and n is the length of x's binary notation. Thus the 
algorithm works in non-deterministic exponential time (as a function of 
input length n). 

To show Spec 5 NE, let Z be a nondeterministic time-bounded Turing 
machine that runs in time 2 c n on inputs of length n. Here the input is 
a word x of length n. We think of binary-coded natural number. 

Computation C can be coded as a word C = config config l . . . config 2 cn 
where config contains the Turing machine's input, and each config t en- 
codes the tape contents and control point at its t-th. computational step. 

Now we have to find a first-order sentence <p such that: 

(i) For every input-accepting Z computation, A4 \= <f> for some model 
M. of cardinality x 

(ii) If Z has no computation that accepts its input, then <p has no model 
of cardinality x 

Each config t length is at most 2 c ' n , so C has length bound 2 c '' n = x c ' 
for c' independent of n. A model Ai of cardinality x contains, for each 
fe-ary predicate symbol P of sentence (j), a relation PC {0,1,... , x — l} k . 
Thus a model can in principle "have enough bits" x k = 2 klogx = 2°^ x ^ 
to encode all the symbols of computation C. 

The remaining task is to actually construct (p so it has a model M. 
of cardinality x if and only if Z has a well-formed computation C that 
accepts input x. In effect, the task is to use predicate logic to check that 
C is well- formed and accepts x. The technical details are omitted from 
this survey paper; some approaches may be seen in [80, 40, 18] H 
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5.5. Relationship to the question P = NP. Let UN = {L\L C 1*} 
be the set of tally languages (each is a set of unary words over the one- 
letter alphabet {1}) and let NPi = NP n UN. Since there is a natural 
identification between NPi and NE, we can deduce that if P = NP, then 
NP = coNP and NE = coNE, i.e. the complement of a spectrum is 
a spectrum. Of course, it also holds that if there is a spectrum whose 
complement is not a spectrum, i.e. if NE ^ coNE, then NP ^ coNP and 
P ^ NP. The converse implication remains open. 

5.6. Independent solutions to Scholz's problem. The character- 
ization of spectra via non-deterministic complexity classes was indepen- 
dently found also by Christen on the one hand and Fagin on the other 
hand during their PhD studies. 

Claude Christen 's thesi^ [18] (1974, ETH Zurich, E. Specker) remains 
unpublished, and only a small part was published in German [19]. Chris- 
ten discovered all his results independently, and only in the late stage of 
his work his attention was drawn to Bennett's work [9] and the paper 
of Jones and Selman [79]. It turned out that most of his independently 
found results were already in print or published by Fagin after completion 
of Christen's thesis. 

Ronald Fagin's thesis (1973, UC Berkeley, R. Vaught) is treasure of 
results introducing projective classes of finite structures, which he called 
generalized spectra (see Subsection \5.7\i that had wide impact on what is 
now called descriptive complexity and finite model theory. Most of our 
knowledge about spectra till about 1985 and, to some extent far beyond 
that, is contained in the published papers (see [41, 43, 42]) emanating from 
Fagin's thesis [40]. In this survey, these papers are pervasive. Right now, 
let us begin with reviewing what is said in [41] about the consequences of 
the complexity characterization of spectra per se. 

Recall that E = (J c >i DTIME(2 c n ) C NE and let us examine the clo- 
sure under complementation problem. Since E = coE, it is clear that if 
a first-order spectrum is in E, then its complement is also a first-order 

spectrum. Of course, the question E = NE is still open. Fagin notes 
that E contains the spectra of categorical sentences, i.e. sentences that 
have at most one model for every cardinality. Thus, one obtains a model 
theoretic question closely related to Asser's question. 

Open Question 23. Is every spectrum the spectrum of a categorical 
sentence ? 

Besides reviewing many natural sets of numbers that are spectra, Fagin 
also proves by a complexity argument the existence of a spectrum S such 



5 Claude Christen, born 1943, joined the faculty of CS at the University of Montreal 
in 1976 and died there, a full professor, prematurely, April 10, 1994. 
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that {n | 2" G S} is not a spectrum (see also [74] for a recent proof by 
diagonalization) . 

5.7. Generalized first-order spectra and NP: Fagin's result. 

Let us spend some time on what is called generalized first-order spectra 
by Fagin in his 1974 paper [41], and is nowadays more usually refered 
to as (classes of finite structures definable in) existential second-order 
logic. Our main goal is to clarify the differences and the connections with 
ordinary first-order spectra. 

In this subsection, we are no longer interested in the size of the finite 
models of some given sentence, but in the models themselves. Hence, let 
a and r be two disjoint vocabularies, and let <p be a first-order a U t- 
sentence. The generalized spectrum of is the class of finite r-structures 
that can be expanded to models of <j>. In other words, it is the class of finite 
models of the existential second-order sentence 3acj) with vocabulary r. 
The vocabulary r is usually refered to as the built-in vocabulary, whereas 
a is often called the extra vocabulary. Note that generalized spectra 
are finite counterparts to Tarski's projective classes (see [124]). Fagin's 
theorem states the equivalence between generalized spectra and classes of 
finite structures accepted in NP. 

Theorem 5.7 (Fagin 1974 [41]). Let r be a non-empty vocabulary. A 
class of finite r -structures K is a generalized spectrum iff K £ NP. 

If the built-in vocabulary r is empty, then the r-structures are merely 
sets. From a computational point of view, it is natural to see such empty 
structures as unary representations of natural numbers. From a logical 
point of view, one obtains ordinary spectra. Hence, Fagin rephrases Jones 
and Selman's complexity characterization of first-order spectra as follows: 

Proposition 5.8. A set S, if regarded as a set of unary words, is a 
first-order spectrum if and only if S £ NPi. 

Concerning the complement problem for generalized spectra, in view 
of Fagin's theorem, it is not surprising that the general case remains 
open. However, the following is known. If a consists of unary predicates 
only, it is called unary. It has been proved in several occasions that 
unary generalized spectra are not closed under complement (see Fagin 
1975 [42], Hajek 1975 [67], Ajtai and Fagin 1990 [3]). For instance, it 
is shown in [42] by a game argument that the set of connected simple 
graphs is not a unary generalized spectrum. In contrast, it is easy to 
design a monadic existential second-order sentence defining the class of 
non-connected simple graphs. 

Since our survey deals with spectra and not with descriptive complexity 
as a whole, we will not say any more on this subject. However, let us note 
that descriptive complexity [76] emerged as a specific field of research out 
of Fagin's paper about generalized spectra. 
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5.8. Further results and refinements. During the late 1970s and 
the 1990s, several results were published that generalize Jones and Sel- 
man's result to higher order spectra on the one hand, and that refine this 
result, in order to obtain correspondences between certain complexity 
classes and the spectra of certain types of sentences. 

In 1977, Lovasz and Gacs [86], it is shown essentially that there are 
generalized first order spectra such that their complement cannot be ex- 
pressed with a smaller number of variables. To do this they introduced 
first order reductions, which became a very important tool in finite model 
theory and descriptive complexity. In fact they were the first to show the 
existence of decision problems which are NP-complete with respect to 
first order reductions. 

First order reductions were used in (un) decidability results early on, 
[123], and more explicitely in [107]. For a systematic survey, see [21, 89]. 
In the context of generalized spectra they were rediscovered independently 
also by Immerman in [75], Vardi in [128] and Dahlhaus in [30]. First order 
reductions are of very low complexity, essentially they are uniform AC 
transductions. The first use of low complexity reduction techniques seems 
to be Jones [78] who termed them log-rudimentary. Allender and Gore 
[4] showed them equivalent to uniform AC reductions. 

Open Question 24. Is there a universal (complete) spectrum So and 
a suitable notion of reduction such that every spectrum S is reducible to 
So? 

Note that this question has two flavors, one where we look at spectra 
in terms of sets of natural numbers, and one where we look at sets of 
(cardinalities of) finite models and their defining sentences. First order 
reductions may be appropriate in the latter case. 

In 1982, Lynch [88] relates the computation time needed to decide prop- 
erty on set of integers to the maximal arity of symbols required in the 
sentence to define this property. Below, we refine the definition of classes 
of spectra in order to take into account some specific syntactic restrictions 
on sentences. 

Definition 5.9. Let d £ N*, the following classes are defined as fol- 
lows. 

(i) SPEC d (resp. F-SPEC d ) is the class of spectra of first-order sentences 
over arbitrary predicate (resp. predicate and function) symbols of 
arity at most d. 

(ii) SPEC(fcV) (resp. SPEC d (/cV)) is the class of spectra of prenex first- 
order sentences involving at most k variables that are all universally 
quantified (resp. and involving predicate symbols of arity at most d). 
The classes F-SPEc(feV) and F-SPEC d (fcV) are analogously defined. 
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(iii) Finally, let SPEC rf (+) be the class of spectra of first-order sentences 
over the language containing one ternary relation interpreted as the 
addition relation over finite segments of N and predicate symbols of 
arity at most d 

Some inclusions between these classes are easy to obtain: more resources 
(in terms of arity or number of variables) means more expressive power. 
Hence, for example, for all i,j E N such that i < j: 

SPEC; C SPECj, F-SPECj C F-SPECj and SPEC; C F-SPECj. 

The following results proves a first relationship between time complexity 
of computation and "syntactic" complexity of definition. 

Proposition 5.10 (Lynch 1982 [88]). 
For all d>l, NTIME(2 d ' n ) C SPEC d (+). 

The converse of this result remains open. It refines the complexity 
characterization of first-order spectra and has many further developments 
that we present in Section [6l From a technical point of view, note that 
Lynch works with so-called "word-models" . Namely, a binary word w with 
length n is seen as a structure with universe {0, . . . , n — 1}, equipped with 
some arithmetics (eg. successor predicate or addition predicate) and with 
a unary predicate that indicates the positions of the digits 1 of w. The 
methods developed in this paper are re-used later on by several authors. 

Open Question 25. Is the inclusion d>l, NTIME(2 dn ) C SPEC d (+) 
proper? 

Finally, Lynch explains that, from an attentive reading of Fagin's proof, 
one can only deduce that if some language L is in NTIME(2 d ' n ), then L 
is in SPEC 2d i.e. is a the spectrum of a first-order sentence involving 
predicates of arity at most 2d. Even though Lynch's result is not an 
exact characterization, but only an inclusion, it has been very influential 
to other researchers. 

In a series of papers published between 1983 and 1990 (see [58, 60, 59, 
61, 62, 64]), Grandjean proposes two fruitful ideas. The first one is to 
use RAM machines as a natural model of computation for general logical 
structures instead of Turing machines, which are best fitted for languages 
(or word structures). The second idea is to remark that the time com- 
plexity seems closely related to the syntactical form of the sentences (and 
more specifically in this case with the number of universally quantified 
variables). Let NRAM(/(n)) be the class of binary languages accepted in 
time 0(f(n)) by a non-deterministic RAM (with successor), where n is 
the length of the input. 
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Theorem 5.11 (Grandjean 1983 [60, 61, 62, 64]). 
For all d > 1, we have NRAM(2 d ' n ) = F-SPEC(ciV) = F-SPEC d (dV). 

The case al = 1 i.e. the case of sentences with one universally quantified 
variable, is more involved: it requires to encode arithmetic predicates such 
as linear order or addition that appear intrinsically in the characterization 
of computation by sentences with one variable. It is developed in the 
papers [62, 64]). In passing, this implies that the presence of the addition 
relation is not mandatory in Proposition 15.101 provided (unary) functions 
are allowed in the language. 

An interesting corollary of the latter characterization is that when the 
number of (universally quantified) variables is fixed, restricting the lan- 
guage to contain function or relation symbols of arity bounded by d only 
does not weaken the expressive power of sentences and define the same 
class of spectra. In other words, the following holds. 

Corollary 5.12 (Grandjean 1983 [60, 61, 62, 64]). For all d > 1, it 
holds that F-SPEC(dV) = F-SPEC d (<iV). 

The original proof of this result relies on complexity arguments based on 
the characterization of f-spec(c?V). We give here a purely logical proof H- 

Proof of Corollary 15.121 For simplicity of notation, we give the 
proof in the case d = 1. Let <p = \/t^ where is quantifier- free and 
whose vocabulary is composed of function symbols of various arities. Let 
Term ('I') be the set of terms and subterms of ^f. The first idea is to 
associate with each element r of Term(^I') a new unary function f T . The 
definition of f T is as follows: 

(i) if r = t or r is a constant symbol, then f T (t) = r, 

(ii) if r = /(ri(t), ... , r^(t)) for some function symbol / of arity k, then 



One obtains a new sentence ^' instead of \& by replacing each term 
t £ Term(fy) by f T (t) in conjunction with the definition of each function 
symbol f T . Let us explain the transformation on some example. 

Let ^ be the following very simple sentence with / of arity 2 and g of 
arity 1: 



fr(t) = f(f T1 (t),... Jr k (t)). 



E(f(f(t,g(t)),t),t) 




Then, corresponds to: 



We thank Etienne Grandjean for kindly giving us this proof. 
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E(f T3 (t)J t (t)) A f^t) = f{f n (t),f t (t)) 

A/r 2 (t) = f(ft(t), / n (t)) A / n (t) = g(f t (t)) A f t (t) = t 

It is easily seen that the only non unary symbols (here /) appear (if 
they do at all) only as an outermost symbol in atomic formula. Let now 
f(o\(t)), . . . , f(Wh(t)) be the list of terms in VI/' involving /. The idea 
is now to replace in VI/' each f(al(t)) by some new term -Fj(i) where Fj 
is of arity one (let's call this new sentence VP") and to write down the 
relations between each pair F^t) and Fj(t) for i,j < h. This provides a 
new sentence tp' = VWi' (VP" A A) where 

A = A (vi(t) = a j (t')^F i (t) = F j (t')) 

The above method shows, when the number of variables is d = 1 how to 
replace h-ary functions by unary functions. However, in order to control 
the definition of the Fis we introduce one additional quantified variable. 
To get rid of this additional variable one can proceed as follows. First, the 
vocabulary is enriched with a binary predicate < interpreted as a linear 
order on the domain, and h unary functions Nj for j < h. Let A' be the 
following sentence. 

A' = 

V(i, t)3(j, x) N(j, x) = (ai(t),i, t)A 

V(j,x) + (h,max)3(j',x') (j',x') = (j,x) + lAN(j,x) < N(j',x')A 
V(j,x) + (h,max)3(j',x')3(i,t)3(i',t') 

(j',x') = (j,x) + lAN(j,x) = (ai(t),i,t)AN(j>,x>) = (a v (1f),i' ',?) 

A{a i {t) = u il {t')^F i {t) = F i ,{t')). 

where V(i, t) stands for Ai<i</i^ anc ^ ^C?> x ) ^ or V \<%<h ^0'> x ) stands 
for Nj(x). Similarly, (j,x) + 1 represents the successor of pair (j,x) in 
the lexicographic ordering of pairs (j, x), j G {1, ... ,h} and x G D. The 
above sentence expresses the fact that the function N (in fact the union of 
functions Nj, j < h) is an increasing bijection from the set {1, . . . ,h}xD 
to the set {(Wi(t), i, t)\i < h,t G D}. This sentence plays the same role 
as the sentence A but this time tuples (ai(t),i,t) with the same first 
component <7,(£) are contiguous in the numbering N. Using a result of 
Grandjean [64], one can replace the linear ordering < by additional unary 
functions. H 
To be complete, one should also mention the earlier (and weaker) result 
obtained by Pudlak [105] by purely logical argument at that time. 

Proposition 5.13 (Pudlak 75 [105]). 
F-SPEC(iiV) C F-SPEC d (2(iV) for all d > 1. 
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In the next section, we will examine more closely the expressive power 
of spectra on restricted vocabulary. The results of this section show that 
a tight connection exists between nondeterministic complexity classes and 
classes of spectra defined by limiting the number of universally quantified 
variables in sentences. A natural question is whether such a connection 
exists when the language itself is limited. In particular 

Open Question 26. Is there a characterization as a complexity class 
of the classes F-SPEC d for all d>l ? 

This question has also some connections with problems addressed in 
Section El 

§6. Approach III: Restricted vocabularies. 

6.1. Spectra for monadic predicates. Maybe the simplest way to 
restrict vocabularies is by limiting the arity of the symbols. In that direc- 
tion, the smallest restricted class of spectra that can be studied is that of 
sentences involving only relation symbols of arity one (so-called monadic 
in the literature). In this case, the following can be proved: 

Proposition 6.1 (Lowenheim 1915 [87], Fagin 1975 [42]). Let r be a 
vocabulary consisting of unary relation symbols only and (j) £ MSOL(r). 
Then the spectrum of 4> is finite or co-finite. 

Proof. Use quantifier elimination or Ehrenfeucht-Frai'sse games. H 
Remark that the even numbers are a spectrum of the following sentence 
with one unary function: 

Vx f(x) /a;A f 2 {x) = x. 

Hence, the most trivial extension of monadic relational vocabulary al- 
ready provide a spectrum which is neither finite nor co-finite. Then, a 
natural question is whether the converse of Proposition 16. II is true or not. 
The following observation can be made by remarking that one can express 
the cardinality of a finite domain set by an existential first-order formula. 

Observation 6.2. Every finite or co-finite set X C N is a first-order 
spectrum for a sentence with equality only (i.e., no relation or function 
symbols). 

This contrast with the fact that every SO-spectrum is also an SO- 
spectrum over equality only. This allows to conclude. 

Proposition 6.3. Ifr consists of a finite (possibly empty) set of unary 
relation symbols, the MSOL(r) -spectra are exactly all finite and cofinite 
subsets of N. 
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6.2. Spectra for one unary function. As remarked above, one unary 
function is enough to define nontrivial spectra. It turns out, however, that 
a complete characterization of spectra for one unary function (with addi- 
tional unary relations) is possible. 

Definition 6.4. A set X C N is ultimately periodic if there are a,p € N 
such that for each n > a we have that re G X iff n+p G J. 

The set of even numbers is ultimately periodic with a = p = 2. Again, 
one may observe 

Observation 6.5. Every ultimately periodic set X C N is a first order 
spectrum for a sentence with one unary function and equality only (this 
is already true if the function is restricted to be a permutation). 

Surprisingly, ultimately periodic sets are precisely the spectra of sen- 
tences with one unary function [34, 66]. 

Theorem 6.6 (Durand, Fagin, Loescher 1997, Gurevich, Shelah 2003). 

Let (f> be a sentence o/MSOL(r) where r consists of 

- finitely many unary relation symbols, 

- one unary function and equality only. 
Then spec(</>) is ultimately periodic, 

Proof. The proof of [34] uses Ehrenfeucht-Frai'sse game argument and 
is restricted to the EMSOL(r) case. The generalization of [66] is an 
application of the Feferman-Vaught-Shelah decomposition method. H 

There exists alternative ways to characterize ultimately periodic sets. 
Among others, they can also be seen as sets of integers definable in Pres- 
burger Arithmetic. Also, since ultimately periodic sets are closed under 
complementation, one have: 

Corollary 6.7. Spectra involving a single unary function symbol are 
closed under complement. 

6.3. Beyond one unary function and transfer theorems. 

There exist several ways to extend a vocabulary with one unary func- 
tion: one may choose to add one (or several) new unary function(s) or, 
in the opposite direction, one may consider vocabularies with only one 
unrestricted binary relation symbol. It turns out that, up to what will be 
called "transfer theorems" in the sequel, both kinds of extension lead to 
very expressive formulas. 

Before going further, notations about classes of spectra need to be re- 
fined to take into account the number of symbols of distinct arities. Again, 
we will distinguish in the sequel whether function symbols are allowed or 
not. We write f-spec with various indices when function symbols are 
allowed, and spec when function symbols are not allowed. 
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Definition 6.8. A set S of integers is in f-spec"' p if there exists a 
first-order sentence 4> such that S = SPEC(0) and the vocabulary of ip 
contains only 

- a function symbols of arity i and 

- at most (3 function symbols of arity less than i, or relation symbols 
of arity less or equal to i. 

Said another way: a set of integers is in spec"'^ if it can be defined 
by a first order formula over the language of a relation symbols of arity i 
and (5 relation symbols of arity less than i. 

When the number of symbols of arity less than i is not restricted, the 
respective class of spectra are denoted by SPEcf'"' and F-SPEcf ,tJ . For 
example, the class of first order spectra over one unary function and 
an arbitrary number of monadic relation and constant symbols (studied 
in Section I6.2[) is denoted f-spec}' w . Similarly SPEC" and F-SPECf are 
abbreviations for spec"' and f-spec"'° Finally, in this setting spec^ 
abbreviates for spec^' (the same holds for F-SPECj). 

Let us examine what are the relations between these different classes of 
spectra. Recall that for all i,j G N such that i < j: 

SPECj C SPEC,, F-SPECj C F-SPEO,- and SPECj C F-SPECj. 
The following inclusions are also easy to see. For all S N, 

SPEC"' W C SPEC, Q+1 and F-SPEC"' W C F-SPEC^ +1 . 

The relationships between spectra of z-ary functions and spectra of i + 
1-ary relations can be made more precise. In [35], it is shown that a 
spectrum of a first-order formula involving any number of unary function 
symbols is also the spectrum of a formula using only one binary relation. 
This can be generalized to any arity. 

Proposition 6.9 ([35, 33]). For every integer i > I, F-SPECj C spec^ + 

The converse is not known not be true. All in once, the following chain 
of inclusions holds. 

SPECj C F-SPECj C F-SPECj C SPECj +1 C SPEC i+1 

It seems difficult to prove the converse of any of the inclusions given 
above. Then, a natural way to study the expressive power of languages 
relatively to spectrum definition is to reduce them through functional 
(here polynomial) transformation. 

Definition 6.10. If / : N — ► N and S is a set of integers, then f(S) = 
{/(n) : n G S} and S l = {n { : n £ S}. 
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Let C\ and C2 be two classes of spectra and / : N — > N be a function. 
A natural question is then the following: 

Let S be a spectrum in C\ and f be a N — > N function, is f(S) a 
spectrum in C2 ? 

In [43] , Fagin showed an interesting equivalence between spectra defined 
by different relational languages. Such results have been called transfer 
theorems since then. The proof can be seen as an extension of the well- 
known interpretation method of Rabin [106] with an additional constraint 
that describes how the domain size of the structure needs to change. 

Theorem 6.11 (Fagin 1975 [43]). For every i > I, S G SPEC 2i <^=^ 
S l g spec 2 . 

Since relational spectra of arity one are finite or cofinite the "transfer" 
theorem above cannot be extended to SPEC-l. Not too surprisingly, if 
function symbols are allowed, a similar and more uniform equivalence 
can be proved. 

Proposition 6.12. For every i > 1, S G F-SPECj <^=^ S l G f-spec-l. 

The flexibility of unary functions as well as their expressive power are 
well emphasized by the following result which show that the image of any 
spectrum under any polynomial transformation is a spectrum involving 
unary functions only, provided the polynomial is "big enough". 

Theorem 6.13 (Durand, Ranaivoson 1996 [35]). Let P(X) G Q[X] of 
degree m > i and with a strictly positive dominating coefficient. Then, 

S G F-SPECi => \P(S)] = {[P(n)l, n G S} G F-SPECx. 

6.3.1. Spectra for one binary relation symbol. An easy consequence of 
Theorem 16.111 is that for every spectrum S, there exists i e N, such 
that S l G SPEC2. This result underlines the great expressive power of 
sentences involving exactly one binary relation symbol. Up to polynomial 
transformation, any spectrum is a spectrum of such a sentence. 

Let BLN C SPEC2 be the set of spectra of a symmetric, irreflexive 
relation (simple graphs). The whole complexity of the spectrum problem 
is already contained in the apparently weaker question of whether BIN 
is closed under complement. 

Theorem 6.14 (Fagin 1974, [41]). BIN is closed under complement 
iff the complement of every first order spectrum is also a spectrum. 

In fact, one can prove the following stronger result: For all X G BIN 
the complement N — X G SPECr, iff the complement of every first order 
spectrum is also a spectrum. 

Open Question 27 (Fagin 1974, [41]). Is every first order spectrum 
in spec3> ? 
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6.3.2. Spectra for two unary functions and more. Here again, Propo- 
sition 16,121 implies that for any spectrum S, there exists an integer % 
such that S % is a spectrum involving unary functions only. This should 
be compared with the very weak expressive power of sentences involving 
unary predicates only. We also know that one unary function leads to the 
very specific class of ultimately periodic sets. The question now is: how 
many unary function symbols are necessary to obtain an expressive (in 
the spectrum framework) fragment of first-order logic. 

Recall that f-spec| denotes the set of first order spectra using at most 
i unary function symbols. Obviously, for all positive integer i, f-spec^ C 
F-SPECj +1 

The inclusion between the two first levels is strict, as shown in [85]. 

Theorem 6.15 (Loescher 1997 [85]). The set {n 2 : n G N} belongs to 
the class F-SPEcf \F-SPEcj, hence the inclusion f-spec] C F-SPEcf is 
proper. 

In fact, more can be proved on the expressive power of sentences with 
two unary functions. 

Theorem 6.16 (Durand, Fagin and Loescher, 1998, [34]). Given k and 
a spectrum S in F-SPEcf. Then kS = {kn : n G S} G F-SPEC^. 

Combined with Proposition 16 . 1 21 or with Theorem l6.13l this implies that 
there is a first order spectrum over two unary function symbols which, 
written in unary, is NP-complete. This also implies that there is a transfer 
result that maps every spectrum to a spectrum over two unary functions. 

6.3.3. Rudimentary sets and spectra of restricted vocabularies. The re- 
lation between rudimentary sets and spectra have been investigated. It 
has been first observed that the set of primes is in f-spec 1 (Grandjean 
1988 [63]). More generaly, it holds that: 

Rud C F-SPECi (Olive 1996 [102]). 

Due to the closure of Rud by polynomial transformation and to the 
existence of the above described transfer results for spectra, it is clear 
that, not only, one can improve: 

Rud C F-SPEcf 

but also 

Rud = Spec if and only if Rud = F-SPEcf 
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However, in view of the following surprising result, there are evidences 
that none of these equalities hold. 

Theorem 6.17 (Woods, 1981, [131]). //Rud = Spec thenNP / coNP 
and NE = coNE 

Since the proof is hardly available and given in a different framework 
in Woods's thesis, we sketch it below. 

Proof. Let LTH the linear time analog of the polynomial hierarchy 
PH. Celia Wrathall gave in [132] a precise complexity characterization of 
rudimentary set by proving that LTH = Rud. 

The following facts are easy to prove. 

(i) LTH C DSPACE(n) C NE 

(ii) If LTH = DSPACE(n) then LTH collapses to some level k. 

(iii) If LTH collapses to some level k then PH collapses to some level k'. 

(iv) LTH = DSPACE(n) implies PH = PSPACE 

If Rud = Spec then, since Spec = NE, it holds LTH = DSPACE(n) = 
NE. Hence, both LTH and PH collapse. 

For the other consequence, suppose NP = coNP. In this case, the 
polynomial hierarchy collapses to NP i.e. NP = PH. If again Rud = 
Spec one knows than LTH = DSPACE(n) = NE and PH = PSPACE. 
Then, since NP C NE one obtains PSPACE = PH C DSPACE(n) which 
contradict the well-known results DSPACE(n) C PSPACE. H 

Hence, although the equality Rud = F-SPEcf might seem realistic at 
first sight, its consequences makes it probably hard to prove. 

6.4. The unary and the arity hierarchies. The results of the pre- 
ceding section show that for any spectrum S, there is a polynomial P 
such that P{S) is the spectrum of a first order sentence with two unary 
functions. This underlines the expressive power of this latter class of 
spectra up to polynomial transformation. However, as we know equality 
between particular classes of spectra defined, for example, by restriction 
on the number or the arity of the predicates in the languages is often 
an open problem. Taking a very particular case, it is even not known 
whether three unary functions "say" more than two as far as spectra are 
concerned. This leads to the following open problems about spectra of 
unary functions. 

Open Question 28 (The unary hierarchy). Is the following hierarchy 
proper: 

F-SPECJ C F-SPECi C F-SPEcf C . . . C F-SPECi C . . .? 

Open Question 29. Is Rud = f-spec^ ? 

Open Question 30. Is spec?, = f-spec-l or even spec.], = f-spec^ ? 
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Both positive or negative answers to these questions would have non- 
trivial consequences. For example, proving Rud 7^ f-spec^ would sepa- 
rate the classes Rud and Spec. In the opposite, we already know that 
Rud = f-spec^ implies NP ^ coNP and NE = coNE. 

Proving that f-spec^ C f-specJ 1 " 1 " 1 for some integer k would also sep- 
arate Spec from Rud. Similarly, a collapse of the unary hierarchy to 
some level k would have strong consequences. It is easily seen that, test- 
ing if a number n (as input i.e. in unary) is in the spectrum of a first 
order sentence over k unary functions can be decided by a deterministic 
polynomial time RAM algorithm that uses k ■ n additional non deter- 
ministic steps. Since from Theorem 15.111 NRAM(2 n ) C f-speC-l then, 
the inclusion f-spec 1 C f-spec^ would imply immediately the follow- 
ing "trade-off" result on nondeterministic RAM computations (see [34]): 
for any arbitrary constant c, any nondeterministic RAM, which given a 
number n as input, runs in time c ■ n can be simulated by a RAM which 
runs in k ■ n nondeterministic steps and in a polynomial number of de- 
terministic steps. For c greater than k (which is fixed) such a result is 
rather unexpected and would strongly modify our understanding of the 
relationships between determinism and nondeterminism. 

Another natural question concerns the relative expressive power of first 
order sentences defined by restriction on the arity of the symbols involved 
in the language. It is open whether any spectrum is the spectrum of a 
sentence over one binary relation only. The question may be refined as 
follows (See Fagin [43, 44]). 

Open Question 31 (The arity hierarchy). Is the following hierarchy 
proper: 

SPECj^ C SPEC 2 C SPEC 3 C . . . C SPEC fe c . . .? 

The same question could be asked for spectra over i-ary functions. 

Although the above problem is still open, Fagin proved the following 
partial result. 

Theorem 6.18 (Fagin 1975 [43]). If spec k = SPEC fc+1 for some inte- 
ger k then, the arity hierarchy collapses and SPEC fc = SPEC m for every 
m > k. 

6.4.1. Collapse of hierarchies and closure under functions. Let C be a 
class of spectra and / be a N —* N function. Class C is closed under / 
if for any spectrum S in C, f(S) is in C. In the spirit of Theorems 16.111 
and 16.131 and Proposition 16.121 one can relate the collapse of hierarchies 
to the possible closure of class of spectra under some function. In [73], 
such problems are studied and several related results are given. 

Theorem 6.19. 
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(i) (Hunter, [73]) The arity hierarchy collapses to SPEC2 if and only if 
SPEC2 is closed under function / :rh ■ 

(ii) The unary hierarchy collapses to F-SPEcf if and only if F-SPEcf is 
closed under function /:ni-» [5] . 

Proof of (ii). In [73], a result similar to (ii) is proved about the num- 
ber of binary predicates instead of the number of unary functions. It is 
not hard to see that his result extends to the case of (ii). H 

6.5. Higher order spectra. In [88], Lynch also proposes a general- 
ization of the characterization given in Proposition 15. 101 to higher order 
spectra. It is shown that the polynomial time hierarchy corresponds to 
second-order logic. In other words, let PHi = PH n UN then, a set 
X C N is a second order spectrum if and only if {1™ : n G X} G PHj. 

Consider SO& the class of second-order formulas where all the second 

vk k 

order variables are of arity at most k. Let 2 = {2 n G N : n G X} for 
ICE The following precise characterization can be obtained. 

Theorem 6.20 (More and Olive 1997). [96] A set X is a spectrum of 
a sentence in SOfc iff ' 2 X is rudimentary. 

6.6. Spectra of finite variable logic FOL fc . We denote by FOL fc 
first order logic with only k distinct variables (bound or free), and by 
SpecFOh k the set of spectra of sentences of FOL fc . FOL fc has been 
studied extensively in finite model theory, [103, 56], but somehow the 
spectrum problem was not adressed for FOL fc in the literatur^). 

With the same proof as for Proposition 16.11 one gets easily the following. 

Proposition 6.21. The spectrum of a sentence in FOL 1 is finite or 
cofinite. 

Let M be a set of natural numbers. Recall that a gap of M is a pair of 
integers g\ , #2 such that g\ , gi G M but for each n with g\ < n < we 
have that n M. We define Tow(k,n) inductively: Tow(0,n) = n and 
Tow(k + l,n) = 2 Tow ^ n \ 

By coding structures which model iterated powersets one gets immedi- 
ately the following. 

Proposition 6.22. For every k there is a (f) G FO A such that the spec- 
trum sp{4>) contains gaps of size Tow(k,n). 

Four variables are used here to express extensionality of the membership 
relation, and closure under unions with singletons. 
Working a bit harder one can prove the following^). 

7 While the third author was lecturing in Chennai in January 2009 on the spectrum 
problem, Dr. S. P. Suresh asked about it. The results below are the fruit of discussions 
with Amaldev Manuel and Martin Grohe. 

8 M. Grohe, personal communication 
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Theorem 6.23 (Grohe). (i) For every Turing machine T there is a 
sentence 4>T £ FOL 3 such that 

s P(j> T = {t 2 '■ T has an accepting run of length t} 

(ii) For every recursive function f there is a sentence <fif in FOL 3 such 
that the gaps in the spectrum of <f>$ grow faster than f . 

The key idea is to encode Turing machines on grids, hence the t 2 in the 
statement of the Theorem. 

In contrast to the above, it follows from the proof that FOL 2 has 
the finite model property and hence is decidable, [56], that the gaps are 
bounded. 

Theorem 6.24. For every (ft £ FOL 2 the spectrum sp{4>) has gaps of 
size at most 2P°^W . 

Corollary 6.25. The following inclusions are proper: 
SpecFOL 1 C SpecFOL 2 C SpecFOL 3 

Open Question 32 (Finite Variable Hierarchy). Does the hierarchy SpFOL k 
collapse at level 3 ? 

§7. The Ash conjectures. The notion of /c-equivalence of structures 
was introduced by Frai'sse [50] in 1954 and the game presentation is due 
to Ehrenfeucht [37] in 1961. 

Definition 7.1. Let a be a vocabulary, let A and B be a-structures 
and let k be an integer. The two structures A and B are /c-equivalent if 
and only if they satisfy the same c-sentences with quantifier depth < k. 

For a detailed presentation of ^-equivalence, we refer the reader to [36, 
72]. For our purpose, the two most important features of the above notion 
are the following. For each finite vocabulary a and for each quantifier 
depth k, the number of /c-equivalence classes of cr-structures is finite and 
we denote it by M a ^. For each finite vocabulary c, for each quantifier 
depth k and for each /^-equivalence class of c-structures C, there exists a 
cr-sentence H>c of quantifier depth k such that, for all cr-structure A, we 
have A G C if and only if A |= ^c- 

In 1994, Ash [6] introduces a counting function relative to the ^-equivalence 
classes: 

Definition 7.2. Let a be a finite relational vocabulary, and let be a 
positive integer. Ash's function counts, for each positive integer n, 
the number of /c-equivalence classes of u-structures of size n. 

This function is obviously bounded by the total number of classes, M^, 
and Ash's conjecture deals with its asymptotic behavior. 
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Open Question 33 (Ash's constant conjecture). Is it true that for any 
finite relational vocabulary a and any positive integer k, the Ash function 
Na- t k is eventually constant? 

A weaker version of his conjecture is also proposed: 

Open Question 34 (Ash's periodic conjecture). Is it true that for any 
finite relational vocabulary a and any positive integer k, the Ash function 
iV^fc is eventually periodic? 

Ash shows by a very neat proof that both conjectures imply the spec- 
trum conjecture (i.e. Spec = coSpec). Let us present the idea of the 
proof. Fix a and k and assume for simplicity that N a ^{n) = a for all n. 
Take a quantifier depth k first-order d-sentence (p. We exhibit a quanti- 
fier depth k first-order cr'-sentence 9 a such that SPEC(# a ) = N + \spec(</>), 
where a' consists in a disjoint copies of a. Let Ci, . . . , Cm ct k be the classes 
of fc-equivalence of cr-structures. All the structures in a given class Ci 
agree on <p, i.e. either \/A G Ci A |= <p or V.4 G Ci A |= -«fi, because <f> 
has quantifier depth k. Form the set X a consisting of all sets of a distinct 
CiS such that \/A G Ci A |= -></>. Take 9 a = \J Y ex a AceY *c> where $>' c 
characterizes C and is written using a distinct copy of a. The sentence 9 a 
has quantifier depth k and n G SPEC(# a ) if and only if there are (at least) 
a distinct /c-equivalence classes containing a structure A with size n and 
A |= —>4>. Since the total number of /c-equivalence classes containing a 
structure A with size n is exactly a, there is no class left for a model of 
4>, and the anounced result follows. 

These ideas of Ash's have not been exploited afterwards, and his paper 
has remained isolated until recently. In 2006, Chateau and More [15] 
published a second paper related to Ash's counting functions. For all 
i G N + , note N~l(i) = {n G N+IA^^n) = i}, the inverse image of the 
positive integer i under the function N a j.. Both Ash's conjectures can 
be rephrased in terms of the sets N~^.(i) and are subsumed under the 
following condition: 

Open Question 35 (Ultra- weak Ash conjecture). Is it true that for 
any finite relational vocabulary a, for any positive integer k and for all 
i G N + , the set N~^{i) is a spectrum? 

This last conjecture is proved to be a necessary and sufficient condition 
for the complement of a spectrum to be a spectrum. 

Theorem 7.3. Let a be a finite relational vocabulary, and let k be a 
positive integer. For all i G N + , the set N~^{i) is a spectrum if and only 
if for every a -sentence <p of quantifier depth < k, the set N + \ SPEC((/?) is 
a spectrum. 
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All in all, the spectrum conjecture is true if and only if the ultra-weak 
Ash conjecture is true. 

Note that, in some cases, one gets interesting additional information 
about complements of spectra. Eg., if the Ash function is eventually- 
constant, then for every cr-sentence <p of quantifier depth < k, the set 
N + \spec(<^) is the spectrum of a sentence of the same quantifier depth 
as ip, over a vocabulary with the same arities as a. 

In order to make some progress, particular cases of Ash conjectures 
may be considered, i.e., by restricting the the sets of pairs (a, k). In his 
original paper, Ash [6] already did so. For instance, he shows that if a is 
a unary vocabulary, then for all k > 1, Ash's function N a ^ is eventually 
constant. In [15], it is also proved that for all finite relational vocabulary 
a, Ash's function N a ^ is eventually constant. However, these results are 
of very limited interest because unary vocabularies or quantifier depth 
two (see [97]) only allow to define finite and cofinite spectra. 

In other cases, solving restricted versions of Ash conjectures happens 
to be as difficult as solving the full conjectures. Let Q be a vocabulary 
restricted to a single binary relation symbol (i.e. the language of graphs). 
As already said, Fagin [43] shows that, up to a polynomial padding, every 
spectrum is a spectrum in SPEC?,. Since spectra are closed under inverse 
images of polynomial, one obtains: 

Proposition 7.4. If for all k > 1 and for all i G N + , the set Ng\(i) 
is a spectrum, then the (full) spectrum conjecture holds. 

It is less known that, as an easy corollary of a result of Grandjean in [64], 
up to a polynomial padding, every spectrum is the spectrum of a quantifier 
depth 3 sentence, using an unbounded number of binary relations. Once 
again, it follows: 

Proposition 7.5. If for all binary vocabulary a and for all i £ N + , the 
set N~l(i) is a spectrum, then the spectrum conjecture holds. 

Since there is no known padding result that uses a finite set of pairs 
(a, k), these results are presently the best possible. 

In order to make further progress in solving particular cases of Ash 
conjectures, Chateau and More introduce a new type of restriction, con- 
cerning the semantics of the vocabularies. 

Definition 7.6. Let a be a finite relational vocabulary, and let C be a 
class of finite u-structures. For any positive integer k, the Ash function 
for the class C, denoted by A^fc, counts the number (< M a ^) of non 
fc-equivalent structures in C of size n for all n > 1. 

Let B be the set of finite Boolean algebras. Clearly, we have: Ng ^(1) = 
{2 a e N | a > 2}, AT-fc(O) = N+ \ {2 a G N | a > 2} and N^ k (i) = 
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for i {0, 1}. From this example, it follows that we cannot expect Ash's 
constant or periodic conjectures to hold for classes of structures. A natural 
question arises: "Which functions can be Ash's function for some class of 
structures?" . Let / : N + i-> N a function bounded by some constant Mf 
and computable in E. Then, it is proved that there exist a vocabulary 
a, a d-sentence ip and a quantifier depth K such that / = N Modf ^^ K , 
where Mod / (tp) denotes the class of finite models of ip. 

Now, let us turn to the ultra-weak Ash conjecture for classes of struc- 
tures. Consider the class Modf(^) of finite models of a first-order sen- 
tence 

Open Question 36 (Ultra- weak Ash conjecture for classes of structures) . 
Is it true that for any finite relational vocabulary a, for any first-order a- 
sentence ^ , for any positive integer k and for all i G N, the set ^ k (i) 

is a spectrum? 

Only somehow expressive classes of structures are interesting, and in 
particular it is natural to require that Modf{$>) contains at least one 
structure of size n, for all positive integer n. 

Theorem 7.7. Let o be a finite relational vocabulary, let ^ be a first- 
order o~ -sentence such that Modf(^) contains at least one structure of 
size n, for all positive integer n and let k be a positive integer. For all 
i G N + , the set N^~ od ^ k (i) is a spectrum if and only if for every a- 
sentence ip of quantifier depth < k which implies ^ (i.e. every model of 
ip is also a model of \I> ), the set N + \ SPEC(</?) is a spectrum. 

In some particular cases, we obtain more information about comple- 
ments of spectra. For instance, if VP expresses that all binary relations 
in o~ are functional (which can be done using a quantifier depth 3 sen- 
tence), and if k > 3, then all binary relations in the sentence defining 
N + \ SPEC(</?) are functional too. 

Let us turn to classes of structures for which the study of Ash's functions 
is as difficult as the general case. More precisely, let us consider the 
following classes of structures: let SQ be the class of simple graphs; let 
2£ be the class of two equivalence relations and let IT be the class of two 
functions. 

Proposition 7.8. Let C e {SQ, 2£, 2J 7 }. If for all positive integer k 
and for all i G N + ; the set N^(i) is a spectrum, then the spectrum 
conjecture holds. 

Once again, this is a consequence of various padding results. Fagin 
actually proves in [43], that simple graphs allow a polynomial padding 
for all spectra. It is also the case for two unary functions, as proved 
by Durand, Fagin and Loescher in [34]. The last result concerning two 
equivalence relations is proved in [14, 38]. 
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§8. Approach IV: Structures of bounded width. 

8.1. Prom restricted vocabularies to bounded tree-width. In 

this section we look again at spectra of sentences with one unary func- 
tion and a fixed set of unary predicates. The finite structures which have 
only one unary function consist of disjoint unions of components of the 
form given in Figure [TJ They look like directed forests where the roots 




Figure 1. Models of one unary function 



are replaced by a directed cycle. The unary predicates are just colours 
attached to the nodes. The similarity of labeled graphs to labeled trees 
can be measured by the notion of tree-width, and in fact, these structures 
have tree width at most 2. Inspired by Theorem 16.61 E. Fischer and J. A. 
Makowsky [49] generalized Theorem 16.61 by replacing the restriction on 
the vocabulary by a purely model theoretic condition involving the width 
of a relational structure. In this section we discuss their results. 

8.1.1. Tree-width. In the eighties the notion of tree-width of a graph 
became a central focus of research in graph theory through the monu- 
mental work of Robertson and Seymour on graph minor closed classes of 
graphs, and its algorithmic consequences [111]. The literature is very rich, 
but good references and orientation may be found in [31, 10, 11]. Tree- 
width is a parameter that measures to what extent a graph is similar to 
a tree. Additional unary predicates do not affect the tree-width. Tree- 
width of directed graphs is defined as the tree-width of the underlying 
undirected graphH. 

Definition 8.1 (Tree-width). A fc-tree decomposition of a graph G = 
(V, E) is a pair {{Xi | i G I},T = (I,F)) with {Xi \ i G 1} a family of 
subsets of V, one for each node of T, and T a tree such that 

(i) U i6 /*i = V. 

(ii) for all edges (v, w) G E there exists ami £ I with v G Xi and w G Xj. 

(iii) for all i, j, k G I: if j is on the path from i to k in T, then Xi n X^ C 
Xj in other words, the subset {t \ v G Xt} is connected for all v. 

9 In [77] a different definition is given, which attempts to capture the specific situation 
of directed graphs. But the original definition is the one which is used when dealing 
with hypergraphs and general relational structures. 
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(iv) for all i G I, \Xi\ < k + 1. 

A graph G is of tree-width at most k if there exists a A;-tree decomposition 
of G. A class of graphs K is a TW{k)-c\ass iff all its members have tree 
width at most k. 

Given a graph G and k G N there are polynomial time, even linear 
time, algorithms, which determine whether G has tree-width k, and if the 
answer is yes, produce a tree decomposition, cf. [11]. However, if k is 
part of the input, the problem is NP-complete [5] 



Graph G Tree decomposition of G 




Figure 2 . A graph and one of its tree-decompositions 

Trees have tree- width 1. The clique K n has tree- width n — 1. Fur- 
thermore, for fixed k, the class of finite graphs of tree-width at most k 
denoted by TW(k), is MSOL-definable. 

Example 8.2. The following graph classes are of tree-width at most k: 

(i) Planar graphs of radius r with k = 3r. 

(ii) Chordal graphs with maximal clique of size c with k = c — 1. 

(iii) Interval graphs with maximal clique of size c with k = c — 1. 

Example 8.3. The following graph classes have unbounded tree-width 
and are all MSOL-definable. 

(i) All planar graphs and the class of all planar grids G m>n . 

Note that if n < uq for some fixed no G N, then the tree-width of 
the grids G mtn ,n < no, is bounded by 2no- 

(ii) The regular graphs of degree r, r > 3 have unbounded tree-width. 

Tree-width for labeled graphs can be generalized to arbitrary relational 
structures in a straightforward way. Clause (ii) in the above definition is 
replaced by 
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(ii-rel): For each r-ary relation R, if v G R, there exists an i G / with 
v G XT. 

This was first used in [45]. 

It is now natural to ask, whether Theorem 16.61 can be generalized to 
arbitrary vocabularies, provided one restricts the spectrum to structures 
of fixed tree-width k. Indeed, E. Fischer and J. Makowsky [49] have 
established the following: 

Theorem 8.4 (E. Fischer and J.A. Makowsky (2004)). 
Let (j) be an MSOL sentence and k G N. Assume that all the models of <fi 
are in TW{k). Then spec(cf)) is ultimately periodic. 

8.2. Extending the logic. First one observes that the logic MSOL 
can be extended by modular counting quantifiers Cfc jm , where C\^ m x <f>(x) 
is interpreted as "there are, modulo m, exactly k elements satisfying 
<j>(x) n . We denote the extension of MSOL obtained by adding, for all 
k,m G N the quantifiers Cfc jm , by CMSOL. 

Typical graph theoretic concepts expressible in FOL are the presence 
or absence (up to isomorphism) of a fixed (induced) subgraph H, and 
fixed lower or upper bounds on the degree of the vertices (hence also r- 
regularity). Typical graph theoretic concepts expressible in MSOL but 
not in FOL are connectivity, ^-connectivity, reachability, /c-color ability 
(of the vertices), and the presence or absence of a fixed (topological) mi- 
nor. The latter includes planarity, and more generally, graphs of a fixed 
genus g. Typical graph theoretic concepts expressible in CMSOL but 
not in MSOL are the existence of an eulerian circuit (path), the size of 
a connected component being a multiple of k, and the number of con- 
nected components is a multiple of k. All the non-definability statements 
above can be proved using Ehrenfeucht-Frai'sse games. The definability 
statements are straightforward. 

Second, one observes that very little of the definition of tree-width is 
used in the proof. The techniques used in the proof of Theorem EH can be 
adapted to other notions of width of relational structures, such as clique- 
width, which was introduced first in [25] and studied more systematically 
in [27], and to patch-width, introduced in [49]. 

8.3. Ingredients of the proof of Theorem 18.41 To prove Theorem 
18.41 and its generalizations below, the authors use three tools: 

(i) A generalization of the Feferman-Vaught Theorem for &-sums of la- 
beled graphs to the logic CMSOL, due to B. Courcelle, [23], and 
further refined by J.A. Makowsky in [89]. 

(ii) A reduction of the problem to spectra of labeled trees by a technique 
first used by B. Courcelle in [24] in his study of graph grammars. 

(iii) An adaptation of the Pumping Lemma for labeled trees, cf. [52] . 
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The proof of Theorem 18,41 is quite general. Its proof can be adapted to 
stronger logics and other notions of width of relational structures. How- 
ever, the details are rather technical. 

8.4. Clique- width. A fc-coloured r-structures is a = rU{Pi, . . . , Pk}- 
structure where Pi , i < k are unary predicate symbols the interpretation 
of which are disjoint (but can be empty). 

Definition 8.5. Let 21 be a fc-coloured r-structure. 

(i) (Adding hyper-edges) Let R £ r be an r-ary relation symbol. 
r ]R,p jl ,... ,Pj r (21) denotes the /c-coloured r structure 23 with the same 
universe as 21, and for each S £ r^, S ^ R the interpretation is also 
unchanged. Only for R we put 

R B = R A u{a G A r : a, L £ P£}. 

We call the operation rj hyper edge creation, or simply edge creation 
in the case of directed graphs. In the case of undirected graphs we 
denote by f]Pj 1 ,P J2 the operation of adding the corresponding undi- 
rected edges. 

(ii) (Recolouring) Pij(2l) denotes the fe-coloured r structure 23 with the 
same universe as 21, and all the relations unchanged but for P A and 
Pf. We put 

Pf = and Pf = Pf U P, A . 

We call this operation recolouring. 

(iii) (modification via quantifier free translation) More generally, for S 6 
Tk of arity r and B(xi, . . . quantifier free T^-formula, 5s,b(%) 
denotes the /c-coloured r structure 23 with the same universe as 21, 
and for each S' € r^, S' ^ S the interpretation is also unchanged. 
Only for S we put 

S B = {a£A r :a£ B A }. 

where B A denotes the interpretation of B in 21. 

Note that the operations of type p and r\ are special cases of the operation 
of type 5. 

Definition 8.6 (Clique- Width, [27, 89]). 

(i) Here r = {Re} consist of the symbol for the edge relation. Given 
a graph G = (V, E), the clique-width of G (cwd(G)) is the min- 
imal number of colours required to obtain the given graph as an 
{i?s}-reduct from a /c-coloured graph constructed inductively from 
coloured singletons and closure under the following operations: 

(i.a) disjoint union (U) 

(i.b) recolouring (pi^j) 

(i.c) edge creation {rjE^p^ 
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(ii) For r containing more than one binary relation symbol, we replace 
the edge creation by the corresponding hyper edge creation ^_r,p Ji ,p jr 
for each R G r. 

(hi) A class of r-structures is a CW(fc)-class if all its members have clique- 
width at most k. 

If t contains a unary predicate symbol U, the interpretation of U is not 
affected by the operations recoloring or edge creation. Only the disjoint 
union affects it. 

A description of a graph or a structure using these operations is called 
a clique-width parse term (or parse term, if no confusion arises). Every 
structure of size n has clique-width at most n. The simplest class of 
graphs of unbounded tree-width but of clique-width at most 2 are the 
cliques. Given a graph G and k € N, determining whether G has clique- 
width k is in NP. A polynomial time algorithm was presented for k < 3 
in [22]. 

It was shown in [47, 46] that for fixed k > 4 the problem is NP- 
complete. The recognition problem for clique-width of relational struc- 
tures has not been studied so far even for k = 2. The relationship between 
tree- width and clique- width was studied in [27, 54, 2]. 

Theorem 8.7 (Courcelle and Olariu (2000)). Let K be a TW(k)-class 
of graphs. Then K is a CW(m)-class of graphs with m < 2 k+1 + 1. 

Theorem 8.8 (Adler and Adler (2008)). For every non-negative inte- 
ger n there is a structure 21 with only one ternary relation symbol such 
that 21 G TW(2) and 21 CW(n). 

The following examples are from [92, 55]. 

Example 8.9 (Classes of clique-width at most k). 

(i) The cographs with k = 2. 

(ii) The distance- hereditary graphs with k = 3. 

(iii) The cycles C n with k = 4. 

(iv) The complement graphs C n of the cycles C n with k = 4. 

The cycles C n have tree-width at most 2, but the other examples have 
unbounded tree-width. 

Example 8.10 (Classes of unbounded clique- width) . 

(i) The class of all finite graphs. 

(ii) The class of unit interval graphs. 

(iii) The class of permutation graphs. 

(iv) The regular graphs of degree 4 have unbounded clique-width. 

(v) The class grids Grid, consisting of the graphs Grid nxn . 

For more non-trivial examples, cf. [92, 55]. In contrast to TW(k), we 
do not know whether the class of all CW(fc)-graphs is MSOL-definable. 
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To find more examples it is useful to note, cf. [90] : 

Proposition 8.11. If a graph is of clique-width at most k and G' is an 
induced subgraph of G, then the clique-width of G' is at most k. 

In [49] the following is shown: 

Theorem 8.12 (E. Fischer and J.A. Makowsky (2004)). 
Let 4> £ CMSOL(t) be such that all its finite models have clique-width 
at most k. Then there are mo,no £ N such that if <j> has a model of size 
n > uq then <f> has also a model of size n + rriQ . 

From this we get immediately a further generalization of Theorem 18.41 

Corollary 8.13. Let (p £ CMSOL(r) be such that all its finite models 
have clique-width at most k. Then spec(4>) is ultimately periodic. 

8.4.1. Patch-width. Here is a further generalization of clique-width for 
which our theorem still works. The choice of operation is discussed in 
detail in [26]. 

Definition 8.14. Given a r-structure 21, the patch-width ofG (pwd(G)) 
is the minimal number of colours required to obtain & as a {r}-reduct 
from a ^-coloured r-structure inductively from fixed finite number of re- 
structures and closure under the following operations: 

(i) disjoint union (U), 

(ii) recoloring (pi->j) and 
(hi) modifications (6s,b)- 

A class of r-structures is a PW T (k)-c\ass if all its members have patch- 
width at most k. 

A description of a r-structure using these operations is called a patch 
term. 

Example 8.15. 

(i) In [27] it is shown that if a graph G has clique-width at most k then 
its complement graph G has clique-width at most 2k. However, its 
patch- width is also k as G can be obtained from G by §e,^e- 

(ii) The clique K n has clique-width 2. However if we consider graphs 
as structures on a two-sorted universe (respectively for vertices and 
edges), then K n has clique-width c(n) and patch- width p(n) where 
c(n) and p{n) are functions which tend to infinity. This will easily 
follow from Theorem 18.201 For the clique-width of K n as a two- 
sorted-structure this was already shown in [115]. 

Remark 8.16. In [26] it is shown that a class of graphs of patch-width 
at most k is of clique-width at most f(k) for some function /. It is shown 
in [48] that this is not true for relational structures in general. 
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In the definition of patch-width we allowed only unary predicates as 
auxiliary predicates (colours). We could also allow r-ary predicates and 
speak of r-ary patch-width. The theorems where bounded patch-width is 
required are also true for this more general case. The relative strength of 
clique- width and the various forms of patch-width are discussed in [48] . 

In [49] the following is shown: 

Theorem 8.17 (E. Fischer and J.A. Makowsky (2004)). 
Let 4> & CMSOL(r) be such that all its finite models have patch-width at 
most k. Then there are mo,no £ N such that if <f> has a model of size 
n > uq then <f> has also a model of size n + tjiq . 

From this we get yet another generalization of Theorem 18.41 

Corollary 8.18. Let (p e CMSOL(r) be such that all its finite models 
have patch-width at most k. Then spec{4>) is ultimately periodic. 

More recent work on spectra and patch-width may be found in [117, 32]. 

8.5. Classes of unbounded patch-width. Theorem 18.171 gives a 
new method to show that certain classes K of graphs have unbounded 
tree-width, clique-width or patch-width. 

To see this we look at the class Grid of all grids Grid nxn - They are 
known to have unbounded tree-width, cf. [31], and in fact, every minor 
closed class of graphs of unbounded tree-width contains these grids. They 
were shown to have unbounded clique-width in [55]. However, for patch- 
width these arguments do not work. On the other hand Grid is M SO In- 
definable, and its spectrum consists of the numbers n 2 , so by Theorems 
18.121 and 18.171 the unboundedness follows directly. 

In particular, as this is also true for every K' D K, the class of all 
graphs is of unbounded patch-width. 

Without Theorem l8.17l there was only a conditional proof of unbounded 
patch-width available. It depends on the assumption that the polynomial 
hierarchy S p does not collapse to NP. The argument then proceeds as 
follows: 

(i) Checking patch-width at most /c of a structure 21, for k fixed, is in 
NP. Given a structure 21, one just has to guess a patch-term of size 
polynomial in the size of 21. 

(ii) Using the results of [89] one gets that checking a CMSOL(r)-property 
<j> on the class PW T (k) is in NP, whereas, by [91], there are S^-hard 
problems definable in MSOL for every level S p of the polynomial 
hierarchy. 

(iii) Hence, if the polynomial hierarchy does not collapse to NP, the class 
of all r-structures is of unbounded patch-width, provided r is large 
enough. 
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Open Question 37. What is the complexity of checking whether a r- 
structure 21 has patch-width at most k, for a fixed k ? 

8.6. Parikh's Theorem. R. Parikh's celebrated theorem, first proved 
in [104], counts the number of occurences of letters in A;- letter words 
of context-free languages. For a given word w, the numbers of these 
occurences is denoted by a vector n(w) G N fe , and the theorem states 

Theorem 8.19 (Parikh 1966). For a context-free language L, the set 
Par(L) = {n(w) G N k : w G L} is semilinear. 

A set X C N s is linear in W iff there is vector a G N s and a matrix 
M G W xr such that 

X = A n U = {b G N s : there is u G N r with b = a + M ■ u} 

Singletons are linear sets with M = 0. If M / the series is nontrivial. 
X C N s is semilinear in W iff X is a finite union of linear sets Ai CN S . 
The terminology is from [104], and has since become standard terminology 
in formal language theory. We note that for unary languages, Parikh's 
Theorem looks at the spectrum of context-free languages. 

B. Courcelle has generalized Theorem l8 . 1 91 further to context-free vertex 
replacement graph grammars, [24]. We want to generalize Theorem 18.191 
to spectra. Rather than counting occurences of letters, we look at many- 
sorted structures and the sizes of the different sorts, which we call many- 
sorted spectra. In [49] the following theorem is proved: 

Theorem 8.20 (E. Fischer and J.A. Makowsky (2006)). 
Let K be a class of CMS OL -definable many-sorted relational structures 
which are of patch-width at most k. Then the many-sorted spectrum of K 
forms a semilinear set. 

Appendix A. A review of some hardly accessible references. 

This section contains a detailed presentation of the material of Subsec- 
tion 231 Note that the proofs sketched here do not necessarily correspond 
to the original proofs. 

A.l. Asser's paper. In chronological ordering, the first paper related 
to spectra is [7], in German, due to Asser in 1955. Though it does not use 
the name "spectrum", nor refer to Scholz in its title or in the text, the 
long introduction clarifies the context in which the concept of spectrum 
was born. The author addresses the general question of classes of cardi- 
nal numbers (not only natural numbers) so-called "represent able" by a 
sentence of first-order logic with equality, both in the framework of satisfi- 
ability theory and validity theory. Here a first-order sentence cp represents 
a given (finite or infinite) cardinality m regarding satisfiablity if there is 
a structure which domain has cardinality m which is a model of ip (i.e. 
for finite m, it means m G spec (</?)), and regarding validity, if (p holds in 
every structure with cardinality m. Asser first notices that tp represents 
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m regarding satisfiability if and only if —xp does not represent m regarding 
validity, so that validity reduces to satisfiability via complement. Then, he 
remarks that, from Lowenheim-Skolem Theorem [87, 118], the represen- 
tation question in satisfiability theory for infinite cardinalities is trivial: 
the first-order sentence ip either has no infinite model (and in this case it 
has finite models in finitely many finite cardinalities only) or has models 
in every infinite cardinality Hence, the problem actually is about exactly 
which sets of natural numbers are the set of cardinalities of finite models 
of first-order sentences, i.e. what we would call spectra. In a footnote, 
one reads "this question was also asked by Scholz as a problem in [116]". 

With this background, Asser's aim is to give a purely arithmetical char- 
acterization of spectra. This is done via an arithmetical encoding of finite 
structures, first-order sentences and satisfiability. Let us make precise 
Asser's construction. 

Note that in the sequel "characteristic functions" (of sets or relations) 
are not taken in the usual way: a unary function / is said to be the char- 
acteristic function of the set of integers n such that f(n) = 0. It is only a 
technical matter to come back to the usual definition with little machin- 
ery, for instance use x( n ) = 1 — f( n ) (so-called modified substraction i.e. 
x— y = x — y if x > y and otherwise). Using this alternative definition, 
characteristic functions are not required to be 0-1-valued. 

W.l.o.g., let if be a sentence in relational Skolem normal form, i.e. 
(p = \/x\ . . . \/x r 3x r+ i . . . 3x s ip(xi, . . . x s ), where ip(xi, . . . x s ) is a Boolean 

combination of atomic formulas (xj 1 , . . . Xj a . ) with i = 1, . . . , t and 
of atoms x/j = x; 2 . Assume that ip contains u different atoms of type 

R^ ll \xj 1 , . . . Xj a ) and v different atoms of type x\ x = xi 2 . Let ^ : 
{0, 1} U+V — > {0, 1} be the Boolean function associated to the propo- 
sitional version of ip (using the convention that encodes true and 1 
encodes false). 

Denote by Bitk(y, z±, . . . ,Zk,n) the binary digit of y of rank Yli=i z i ' 
n 1-1 , assuming y < 2 n , z\ < n,...,Zk < n. Encode the fc-ary rela- 
tion R on the domain {0, ... , n — 1} by the number y < 2 n such that 
Bitk(y, z±, . . . , Zk, n) = if and only if R(zi, . . . , z^j holds. Let S(zi, z-i) = 
if z\ = Z2 and 1 otherwise. Let \I/*(yi, . . . , yt, x\, . . . , x s , n) be obtained 
from ^ by replacing each atom R^ (xj 1 , . . . Xj a . ) by Bit ai (yi,Xj 1 , . . . , Xj a . , n) 
and every atoms x^ = xi 2 by 5(xi 1 ,xi 2 ). The first-order quantifiers 

dealt with by defining 

n— 1 n—1 n—l n—l 

***{yi,...,yt,n)= £■■■£ U . . . H **(y,x,n). 

xi=0 x r =0i r +i=0 x s =0 
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Note the non-standard use of ^ for V and Y\ f° r 3, due to the fact that 
encodes true and 1 encodes false. Finally the characteristic function 
of the spectrum of the sentence (p = Vxi . . . \/x r 3x r+ i . . . 3x s ip(xi, . . . x s ) 

2 n<ll -l i-r2 nat -l 

is x (n) = [\ yi=0 ■■ -11^=0 * (yi>-- - ,yt,n). This construction is 
clearly elementary. Conversely, it is also easy to verify that any func- 

2 n<11 -1 t-tO™"*-! 

tion defined as x(n) = U yi =o •••Ii 2/t =o * (tfi> ••• iVt,n), where 
is obtained from some Boolean function ^ by the same type of construc- 
tion, is the characteristic function of the spectrum of the corresponding 
first-order sentence. Hence we have the following result. 

Theorem A.l. A set S is a spectrum iff its characteristic function \ 

has the form X (n) = Y\ yi=0 ■ ■ .U yt =o • • • > Vu n), where *** is 

obtained from some Boolean function ^ by the above construction. 

Note that Asser judges his result "non satisfactory", in particular be- 
cause this paraphrastic characterization is of no help in proving that a 
given set is or not a spectrum, or in providing any concrete spectrum. 
However, Asser's characterization is enough to prove Theorem 14. 101 that 
we restate here for sake of self-containment. 

Theorem KM Spec c 

The inclusion follows from the fact that Theorem I A. II provides elemen- 
tary characteristic functions for spectra. The properness is obtained by 
diagonalization. 

As a conclusion, Asser asks some questions, that have essentially re- 
mained open up to now. First, he asked for a recursive characterization 
of spectra. He notes that there are actually two different problems. The 
first one asks for a recursively defined class of functions, i.e. a class of 
functions defined via some basis functions and closure under some func- 
tional operations, such that the unary functions in this class are exactly 
the characteristic functions of spectra. Second, he asks for a recursively 
defined class of functions, but now such that the unary functions in it 
enumerate exactly the spectra, i.e. a set S is a spectrum if and only 
if S = /(N) for some / in the class. Note that this is not the most 
commonly admitted meaning for enumeration, because the enumeration 
functions are usually required to be strictly increasing, which is not the 
case here. 

Next, Asser refers to "work in progress" that proves that a large class 
of unary functions are characteristic functions of spectra, among which 
the following arithmetically defined sets: prime numbers, multiples of a 
given integer k, powers of a given k, k-th powers, composite numbers. a 

Finally, the third and most famous open question proposed in this paper 
is usually known as Asser's Problem (Open Question [2|) and asks whether 
spectra are closed under complement. 
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A. 2. Mostowski's paper. A paper almost simultaneous with Asser's 
is [98], due to A. Mostowski in 1956. It also adresses recursive character- 
ization of spectra, and explicitly uses the name "spectrum" . It is noticed 
that "The results of Asser overlap in part with results which I have found 
in 1953 while attempting (unsuccessfully) to solve Scholz's problem (cf. 
Roczniki Polskiego Towarzystwa Matematycznego, series I, vol. 1 (1955), 
p. 427). I shall give here proofs of my results which do not overlap with 
Asser's." 0. 

Here, A. Mostowski defines a class of functions denoted by K as follows. 
Definition A. 2. The class K is the least class 

- containing the functions Zk,U^,S,C respectively defined by: 

- Z k (xi, ... ,x k ,n) = 

- Ut{xi, . . . , Xft, n) = min(xi,n), for i = 1, . . . , k 

- S(x, n) = min(x + 1, n) 

- C[x) = n 

- closed under composition: f(x±,... , Xj-i, g(yi, ■ ■ ■ , y p , n), Xj + i, . . . ,Xk,n) 

f(0,x,n) = g{x,n) 

f(x + 1, x, n) = min(h(x, f(x, x, n),x, n),n) 

The basis functions Z k , Ul and S are intended as the classical zero, 
projections and successor functions, but the special variable n always 
bounds their values. The function C is intended as a maximum function. 
The functional operations composition and recursion are also bounded by 
n. The main result of Mostowski's paper is the following theorem. 

Theorem A. 3. For any unary function f £ K , the set {n + \ \ f(n) = 
0} is a spectrum. 

Let us give an idea of the proof via an example. Consider the functions 
/, g and h defined as follows: 

- f(x, n) = 1 if x = and fix, n) = otherwise. 
~ f(0,n) = S(Z(C(n),n),n) 

f(x + 1, n) = min(Z(f(x, n),n),n) 
g(x, n) = if x is even and g(x, n) = 1 otherwise. 



closed under recursion: 



I.e. 



I.e. 



g(0,n) = Z(C(n),n) 
g(x + 1, n) = min(f(g(x, n),n),n) 
- h{n) = g(C(n),n). 
Clearly we have h £ K and h(n) = if and only if n is even. Let us derive 
from the definition of h a sentence ip in the vocabulary 

a = {<, min, max, Succ®, Ilf* ,R^,B^} 



10 Thanks to J. Tomasik, we have seen a translation of the Polish reference. It is the 
abstract of a seminar given by Andrzej Mostowski on October, 16. 1953. In addition to 
the following material, it is also stated that spectra form a strict subclass of primitive 
recursive sets, a result which indeed overlaps with Asser's. 
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such that ip has a model with n + 1 elements if and only if h(n) = (i.e. 
SPEC(^) is the set of odd numbers). The key point of the construction is 
that the functions in the class K can be interpreted as functions on finite 
structures (eg. from {0, . . . , n} k to {0, . . . , n}) without loss of generality, 
because of the special variable n that bounds all their values. 

The sentence -0 first expresses the fact that < is a linear ordering, min 
and max are its first and last elements and Succ its successor relation. 
Then, ip describes the behavior of the predicates Rf, R g and corre- 
sponding to the graphs of the functions /, g and h. For instance, R g 
obeys the conjunction of the following sentences: 

- g is functional in its first variable: 

Vx, y (y = max — ► 3\zR g (x, y, z)) 

- the second variable in g is always n: 

Vx, y, z (R g (x, y,z) — > y = max) 

- description of the base case of the definition of g: 

\/x, y, z [(Rg(x, y, z) A x = min) — ► z = min] 

- description of the recursive recursive case of the definition of g: 

Vx, y, z[(R g (x, y, z) A -*x = min) 
— > 3t, u(Succ(t, x) A R g (t, y, u) A Rf(u, y, z))] 
Our goal is then achieved by adding to ip the following condition: 

Vx, y (R^x, y) — ► (x = max A y = min)) 
Finally, it is clear that SPEC('0) is the set of odd numbers as required. 
Mostowski asks if the converse is true, i.e. 

Open Question 38. Is every spectrum representable as {n+1 \ f(n) = 
0} for some function f € K? 

No answer is known up to now. 

As a conclusion, new examples of spectra are presented: the set of 
integers having the form n\ for some n, and the set {n \ n 2 + 1 is prime }. 
Also, Mostowski asks whether Fermat's prime numbers, i.e. primes of 
the form 2 2 +1, form a spectrum. This question can be understood in 
two different ways, as noticed by Bennett: which one of the sets A = 
{p | p is prime and p = 2 2 " + 1 for some integer n} and B = {n \ 2 2 " + 
1 is prime } is intended ? Using rudimentary relations, the set A is easily 
proved to be a spectrum, whereas it is still not known for the set B. 

Finally, let us remark that it is ordinarily considered that what Mostowski 
proved is that the unary relations in £ 2 are spectra. This is not exactly 
the case, but the legend is most probably due to the fact that Bennett at- 
tributes this result to Mostowski. However, Bennett also notes that, even 
if it is easy to prove that K C £ 2 , it is not clear that the bounded version of 
any function in £ 2 (i.e. fb(x\, ■ ■ ■ , x^, n) = min(f(xi, . . . , x^), n)) is in K. 
Mostowski's construction crucially relies on the fact that the functions in 
K are bounded by their last variable, and does not generalize to functions 
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in £ 2 . In contrast, it is not difficult to verify that the bounded versions 
of addition and multiplication are in K, and consequently that the rudi- 
mentary relations have their characteristic functions in K. Whatever, it 
is true that the unary relations in are indeed spectra, see Corollary 

A. 3. Bennett's thesis. This is a huge work titled "On spectra" [9], 
but which also deals with a lot of other subjects. Bennett's thesis is un- 
published, and only available via library services. It is one of the remark- 
able early texts anticipating later developments in finite model theory, 
definability theory and complexity theory. It contains a characterization 
(and various definitions) of rudimentary sets and already relates spectra 
to space bounded Turing machines, thus catching a glimpse of many of 
the results concerning spectra that were formulated and proved in more 
modern language after 1970. 

Not only first-order spectra are considered by Bennett, but also spectra 
of higher order logics, and not only sets, but also many-sorted sets, all in 
all spectra of the whole theory of types. This full generality makes the 
notations quite clumsy. The use of many-sorted structures corresponds 
to relations of arity greater than one, and the use of higher order logics 
provides more complicated relations. 

We shall limit ourselves with the cases of one-sorted (i.e. ordinary) 
spectra of orders one and two. Note that the first item of Theorem IA.4I 
is also stated as Theorem 14. 131 in Subsection 14.31 

Theorem A.4 (Bennett, 1962 [9]). 

(i) A set S C N is a first-order spectrum iff it can be defined by a 
formula of the form 3y < 2 xl R(x,y) for some j > 1, where R is 
strictly rudimentary. 

(ii) A set S C N is a second-order spectrum iff it can be defined by a 
formula of the form 3y < 2 x3 R(x,y) for some j > 1, where R is 
rudimentary. 

Spectra of higher order are characterized by similar features: spectra 
of order 2n correspond to rudimentary relations prefixed by an existen- 

tial quantifier bounded by an iterated exponential 2 with height n, 
and spectra of order 2n — 1 correspond to strictly rudimentary relations 
prefixed by an existential quantifier bounded by an iterated exponential 
with height n. Spectra of sentences over a d-sorted universe have the same 
types of characterizations, using By < 2 max ( a:i >- > x d) 3 R{x\, . . . ,Xd,y)- Fi- 
nally, the spectra of the whole type theory are characterized as the ele- 
mentary relations. 

Bennett also introduces several other subrudimentary classes, respec- 
tively called "strongly", "positive" and "extended" rudimentary relations, 
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which yield a bunch of slightly different characterizations of spectra, which 
may witness various unsuccessful attempts to design a truly satisfactory 
characterization. In this survey, we shall limit ourselves to Rud and 
Srud. 

Some consequences of the characterization theorem (not all of them are 
immediate) : 

Corollary A. 5. 

(i) For each n > 1, the class of spectra of order n is closed under A, 
V, bounded quantifications, substitution of rudimentary functions, 
explicit transformations and finite modifications. 

(ii) For each n > 1, the class of spectra of order 2n is closed under 

(iii) The class of first-order spectra contains the rudimentary relations 
and 6+ . 

(iv) The class of second-order spectra strictly contains the rudimentary 
relations. 

(v) For each n > 1, spectra of order n form a subset of spectra of order 
n + 1 and a strict subset of spectra of order n + 2. 

We propose below a proof of Bennett's theorem. 

PROOF of Theorem IA.41 dTTJ) We first present the second-order case, 
because it has less technical difficulties. 

- First inclusion: {spec(v?) | p> G SO} C {By < 2 xJ R(x,y) \ j > 
1 and R G Rud} i.e. p has a model with x elements iff 3y < 2 xJ R{x, y) 
is true. 

W.l.o.g. we may assume that p> has no first-order or second-order free 
variable (just quantify existentially in case there are any). Assume the 
second-order variables appearing in <p have arities strictly less than j. 
Then we take y = 2 xJ . We encode a second-order variable Z with arity 
a < j by the number z < 2 X < y in the usual way. Hence, every second- 
order quantification QZ^ in p is translated into the first-order bounded 
quantification Qz < 2 X < y. Recall that Bit(a, b) is true iff the bit of 
rank b of a is 1. Now, every atomic formula Z(z\, . . . ,z a ) is translated 
into Bit(z, z\ + Z2 ■ x + . . . + z a ■ x a_1 ). Every first-order quantification 
qz in <p is translated into the bounded quantification qz < x. The atomic 
formulas z = z' in <p remain unchanged. Let p' denote the obtained 
formula. Finally, let R = (y = 2 xJ ) A p' . 

- Second inclusion: {spec^) | p G SO} 5 {3y < 2 xl R(x,y) \ j > 
1 and R G Rud} i.e. 3y < 2 xJ R(x, y) is true iff <p> has a model with x 
elements. 

First, we use three existentially quantified relations, namely <( 2j ) which 
is bound to be a linear ordering over the j-tuples of vertices, +( 3j ) which 
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is bound to be the associated addition and x( 3j ) which is bound to be 
the associated multiplication. Let us denote by Arithm(<, +, x) the first- 
order sentence expressing this requirement. Note that we may now use for 
free any usual arithmetic predicate on numbers bounded by x 3 (written 
in x-axy notation, i.e. seen as j-tuples of integers in {0, . . . , x — 1}). Next, 
all variables in R, including x and y, are encoded by j'-ary second-order 
variables in ip in the usual way. For instance if x = Ym=o ^ h > we ^ 
X = {(i ,0,... ,0),... ,(v,0,... ,0)}. ' ' 

W.l.o.g. we may assume that all the atomic formulas in R are of type 
u • v = w (concatenation), which we translate into 

Con_cat(U, V, W) = 

3tV(t) A Vz(V(z) — ► z < t) A Vz(U(z) — ► z < (max - t)) 
AMz(W(z) < ► ((z <t A V(z)) V (z>tAU(z-t)))) . 

Note that this sentence would be cleaner in dyadic than it is in binary, 
but the whole encoding would also be more complicated because two 
unary relations are needed to encode an integer in dyadic (the set of Is 
and the set of 2s) because its length is fixed. 

In order to translate the bounded quantifications in R, we also need the 
following first-order sentence, which expresses the fact that the integers 
u and v respectively encoded by U and V are such that u < v. 

Smaller(U, V) = 

3z (V(z) A -^U{z) A VV > z^U(z')) 

V3z (V(z) A U(z) A VV > z (-^Viz 7 ) A -U(?)) A 37 <z (Viz 7 ) A -U(?))) . 

Now, let R' be obtained from R by applying the following rules: every 
bounded first-order quantification \/z < z' . . . is translated into the sec- 
ond order quantification MZ^ Smaller (Z, Z 1 ) — ► and accordingly 
for 3z < z' . . . ; and every atomic formula u ■ v = w is translated into 
Concat(U,V,W). 

It remains to express that X encodes the size x of the domain, which 
is done using the binary notation of (max, 0, . . . ,0), which represents the 
largest element of the domain. More precisely, we have max + 1 = x, 
which translates in binary as follows: 

Dom(X) = 

Vz(((X(z) AV? < z-iX(j?)) — ► ->Bit((max,0, . . . ,0),z)) 
A((37< zX(7)) — >JX(z) < — >_Bit((max,0,... ,0),z))) 
A((37> z(X0) AVi 77 <i 7 ^X(? 7 ))) — > Bit {{max, 0, . . . ,0),z))) 

Finally, ip is 3 <^ 3+ (3 ^3x^3Y^3X^(Arithm(<,+, x)ADom(X)A 
R>). 
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(jl|) Next we turn to the first-order case. We consider the proof of the 
second-order case and show how it has to be modified in order to fit to 
the first-order case. Note that the proof is now more tricky, and we use 
dyadic notation because we have to be more precise. 



- First inclusion: {spec(</?) | <p € FO} C {3y < 2 X R{x,y) \ j > 
1 and R € Srud} 

The main difference concerning (p is that it contains no second-order 
quantifications. Concerning R, we have to deal with two differences: 
bounded quantifications are now replaced by part-of quantifications (Vzi \ 
Z2 and 3z\ \ Zq) on the one hand and we have to use concatenation instead 
of arithmetic on the other hand. 

However, ip does contain free second-order variables, say z[ ai \ . . . , zj? k ' , 
which we do not encode in the usual way because Srud does not allow 
to use arithmetical predicates, hence the Bit predicate is not available. 
Instead, we assume for now that the alphabet is {1, 2, *, *, •} and we first 
define a provisional predicate R'(x,y). We shall explain later how to get 
rid of the extra symbols *, * and • to obtain the expected R(x,y). 



We use the following encoding: if Z = {(x\, . . . , x a ), . . . , {x\, . . . , Xa)}, 
with p < x a , then let z = * * x\ * . . . * x\ * * . . . * *x\ * . . . * x p a * *. Note 
that we have \z\ < x a ■ a ■ (\x\ + 2). 

Let us define xq = *x * (x — 1) * . . . * 1*, i.e. the dyadic representation 
of xq is the concatenation of the dyadic representations of all integers in 
{1, . . . ,x}, separated by *s. Note that \xq\ < x ■ (\x + 2|) < x 2 . Finally, 
let y = »z\ • . . . • Zj~ • xq*. Clearly we have y < 2 xJ for some j > 1. 

Now, R'(x,y) will begin with 3z\ \y ... 3zk \ y 3xq \y ((y = *Z\ • . . . • 
Zk • xq») A ->(• \ Zi) A . . . A -<(• \Zk) A \xq) A . . . ), in order to retrieve 
the significant parts of y. 

We use xq to replace every first-order quantification Vn . . . appearing 
in (p by a part-of quantification Vn \ xq (Int(u, xq) — > . . . ) in R', and 
similarly for 3u . . . , where Int(u, xq) means that u is a maximal non- 
empty string of Is and 2s in xq. The most technical part of the proof is to 
write a strictly rudimentary formula Doiti(xq, x) which is true iff xq has 
the expected form, but for sake of brevity, we do not explicit this formula. 
In particular, note that we now consider the domain as {1, . . . , x} instead 
of {0, . . . , x — 1} as we did previously. Finally it is not difficult to write 
a formula Verif(xQ, z) expressing the fact that z has the expected form 
** x\* . . . * x^ *★...★ *x\ * . . . * x p * *. Namely, take 
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Verif(xo, z) = 3u\ z 

(*u* \z) A V-u \z(((*u* \z) A fit) A u ^ e) — > 3v\ \u . . . 3v a \u 
(Int(vi,xo) A ... A Int(v a , xo) A u = *v\ * ... * v a *)) 
AVui, i*2, a, (3, 7 |"2(((<z = o;*iti*/9*it2*7V5; = a*i*i*U2*7) 
A-.(* \ui) A -■(★ fu 2 )) — ► i*i / u 2 ). 

There are two types of atomic formulas in ip: equalities z\ = Z2 and 
atoms Z(z\, . . . , z a ). Equalities remain unchanged and Z(zi, . . . , z a ) is 
changed into * * z\* . . . * z a * *\ z. These operations lead to the strictly- 
rudimentary formula ip' . 

Finally, R'(x, y) is 3zi \ y . . . 3z k \ y3x \ y((y = *z\ • . . . • z k • x ») A 
->(• \ zi) A ... A ->(• \z k ) A ->(• \xq) A Dom(xo,x) A Verif(x ,Zi) A ... A 
Verif(x ,z k ) A ip'). 

To obtain R, it remains to get rid of the alphabet {1, 2, *, *, •}. Let * be 
a string of Is which is not a subword of x, x — 1, . . . ,2 and 1. For instance, 
* could be of length \x\ + 1. Let * = 2 * 2 and • = 22 * 22. The final length 
of y is polynomially longer than it used to be, which remains acceptable. 
Finally, take R{x,y) = 3*\y 3*\y 3»\y ((Vn \ * (u = 1)) A * ^ e A ★ = 
2*2A» = 22 * 22 A R'). Note that strictly rudimentary relations do not 
define predicates referring to the length of integers, so that * cannot be 
bound to be some specific word like ll x l +1 . 

- Second inclusion: {spec(i^) | (p G FO} 5 {3y < 2 xJ R(x,y) \ j > 
1 and R G Srud} 

The main difference with the second-order case concerning ip is that 
it only contains first-order quantifications. However, we are still free to 
choose as many free second-order variables as we may need. In particular, 
we still use usual arithmetic predicates on the (j-tuples of) elements of 
the domain, and the previous first-order sentence Arithm(<, +, x) is still 
required to hold for this purpose. In addition, we introduce the second- 
order variables X±, X2 and Y\, Y2, both of arity j, respectively representing 
the set of positions where x and y have Is and 2s and no other second-order 
variables are introduced. Let Word(Xi, X2) be the sentence expressing 
the fact that X\ and X2 (and similarly Y\, Y2) do represent a dyadic word, 
namely 

Word(X 1 ,X 2 ) = Vz-i(Xl(z) A X 2 {z)) 

A3zit({t >z — ► (-1X1 (t) A ^X 2 (t))) A (t < z — ► V X 2 (t)))). 

Concerning R, we may assume w.l.o.g. that it only contains part-of quan- 
tifications qz \x and qz \y and no qz \z' for z' {x,y}. 

The main trick is that a part-of quantification 3z\y . . . (for instance) 
will be replaced by 2j first-order quantifications 3~z~i3z~2 (zi < Z2 A . . . ), 
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where ~z\ and ~Z2 encode the positions where z begins and ends, as a 
subword of y. 

We have to translate the atomic formulas u ■ v = w. W.l.o.g. we may 
rewrite R in an equivalent formula by replacing everywhere u-v = w with 
(u-v = w Au\y Av \y Aw \ y)V (u-v = w Au\y Av \ y Aw \ x)V . . .V (u-v = 
wAu\xAv\xAw\x). Hence, there are 8 slightly different cases to be 
taken care of. We limit ourselves with the case u-v -w Au\y Av\y Aw\y. 
The corresponding formula Concat yyy (ui, vq, v~i, vo~, wT,W2) is as follows: 

W^ = Wl + U2-Ul+V2-Vl 

AVz(W[ < z < Wi + u 2 — ui — ► 
((Y 1 (z)_^Y 1 (z- wp) Aiy 2 {z) <— Y 2 (z - wi)))) 
AVz(W[ + U2—Ul<Z< W2 — ► 

{(Yi(z) < — ► Yi(z - W[ - m + ui)) A (Y 2 (z) < — > Y 2 (z - W[ - + ul)))). 
Let us denote by R' the obtained sentence. 

The last remaining part is to write out a sentence Dom'(Xi,X 2 ) ex- 
pressing the fact that X\ , X 2 encodes (in dyadic) the cardinality of the 
domain, i.e. the successor of the j-tuple (max, 0, ... ,0). This is a bit 
more technical than the sentence Dom(X) we used for the binary no- 
tation and we do not spell it out here. Finally, take ip = Arithm A 
Word(X 1 ,X 2 ) A Word(Y u Y 2 ) A Dam(X 1 ,X 2 ) A R'. H 

Connections with complexity classes. At the beginning of complexity 
theory, the usual compexity classes such as the polynomial hierarchy had 
not emerged yet. So the classes used by Bennett are not standard ones. 
He considers two hierarchies based on space-bounded deterministic Tur- 
ing machines defined in a recursive fashion: the base class is of type 
FDSpace(f(n)), and the next class has a space bound which is a func- 
tion in the previous class. 

Let us denote by (JZ l )i>i the first hierarchy, introduced in Ritchie's 
1963 paper [110], which comes from from his Ph.D. thesis [109]. 

Definition A. 6 (Ritchie's classes). 

- Let 1Z 1 be the class of functions computable by some (determin- 
istic) Turing machine in space bounded by b • max(x > ) on input 
it, where b > 1 is some integer fixed for each machine, i.e. 1Z 1 = 
FDSpace(0(2 n )) in modern notation. 

- For each i > 1, let us denote by 1Z l+l the class of functions com- 
putable by a Turing machine in space bounded by B(Hf), where B 
is some function in 1Z l , fixed for each machine. 

- For each i > 1, let us denote by Tl\ the class of relations whose 
characteristic functions are in W . 
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It is proved in [110] that this hierarchy (JZ\)i>i is strict and that its 
union corresponds to elementary relations. 

Using the same pattern, Bennett introduces a second hierarchy, that we 
denote by {B l )i>\. 

Definition A. 7 (Bennett's classes). 

- Let B 1 be the class of functions computable by some (deterministic) 
Turing machine in space bounded by P(~a?) on input H?, where P 
is some arithmetical polynomial fixed for each machine, i.e. B 1 = 
FDSpace{{2°^)) in moder n notation. 

- For each i > 1, let us denote by B t+l the class of functions com- 
putable by a Turing machine in space bounded by B(lt), where B 
is some function in B l , fixed for each machine. 

- For each i > 1, let us denote by Bl the class of relations whose 
characteristic functions are in B l . 

Bennett shows that Ritchie's classes 1Z\ come in between spectra of 
various orders, but not in a very nice way. In contrast, he proves nice 
closure properties and an exact intercalation between the classes of spectra 
of consecutive orders for the classes B\ . However, all these classes are too 
big to be informative concerning relationship between first-order spectra 
and complexity classes. 

In order to state the next theorem, let us denote by S l the class of 
(many-sorted) spectra of formulas of order i. For instance Spec is the 
class of unary relations in S 1 . 

Theorem A. 8. 

(i) Kl C S 3 and for each i > 2, S 2i ~ 2 C 7% C S 2i+1 . Moreover, for no 
i,j>l does 1Z\ = 

(ii) For each i > 1, S 21 C B\ C S 2l+1 (equality or strictness is un- 
known) and 1Z\ C B\ C IZ 1 ^ 1 . Moreover, Bl is closed with respect 
to union, intersection, bounded quantifications, substitution of rudi- 
mentary functions, explicit transformations and finite modifications. 

The proof of item (JTJ) is based on recursive characterizations of the 
classes TZ\ , whereas item ([n]) is stated without proof. 
A. 4. Mo's paper. 

There is a late paper on the recursive aspect of spectra, namely [95], due 
to the Chinese logician Mo Shaokui in 1991, only available in Chinese (see 
the author's English abstract in Mathematics Abstracts of Zentralblatt 
[94]). With the help of Zhu Ping [133], we have been able to state Mo's 
result, and we propose a proof sketch. 

Definition A. 9. Let x,xi,X2, ■ ■ ■ and y, yi, jj2, ■ ■ ■ be two disjoint sets 
of variables. Let Mo be the smallest class of predicates over integers 
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containing the relations x\ + X2 = £3, x± x X2 = X3 (both for variables 
of type x only) and Bit(y, x) (where the first variable is of type y, the 
second of type x) and closed under Boolean operations and (polynomially) 
bounded quantifications for variables of type x only. 

Note that a predicate in Mo has two types of variables, which do not 
play similar roles, and that Mo extends the rudimentary relations by the 
use of Bit(y, x) atoms, which are not definable because y variables are 
not allowed in the atomic formulas for addition and multiplication. 

Theorem A. 10. {spec(</>) | <p e FO} = {3 Vl < 2 xh ...3y k < 2 xjk 
R(x,yi, ... ,y k ) I k,ji, . . . , j k > 1 and R £ Mo} 

Proof. It is a slightly modified version of the proof of Bennett's the- 
orem for second-order spectra. 

- First inclusion: 92 has a model with x elements iff 3y\ < 2 xJ1 . . . 3y k < 
2 xJk R(x,yi, ... ,y k ) is true. 

We encode a predicate symbol Y with arity j by the number y < 2 xJ in 
the usual way. Hence, every atomic formula Y(x\, ... , Xj) is translated 
into 3x' < xi(Bit(y, x') A x' = x\ + X2 ■ x + . . . + Xj • x^ 1 ). Every first- 
order quantification qx,i in if is translated into the bounded quantification 
qxi < x. The atomic formulas x\ = X2 in <p remain unchanged. Let R 
denote the obtained formula with free variables x, yi, . . . ,y k - 

- Second inclusion: 3yi < 2 xn . . . 3y k < 2 xJk R(x,y\, . . . ,y k ) is true iff 
ip has a model with x elements. 

First, we use three predicate symbols, namely <( 2 ) which is bound to be 
a linear ordering on the vertices, +® which is bound to be the associated 
addition and x ( 3 ) which is bound to be the associated multiplication. Let 
us denote by Arithm\{<,+, x) the first-order sentence expressing this 
requirement. Note that we may now use for free any usual arithmetic 
predicate on numbers bounded by x. 

Next, every free variable of type y in R and bounded by 2 x3 is translated 
into a predicate symbol Y of arity j. 

W.l.o.g., we may assume that all the bounded quantifications in R are 
of type qx' < x l for some i > 1. The bounded quantification qx' < x % in 
R is simply translated into qx' x . . . qx[ and x' is represented by the i-tuple 
(x[, . . . , x'j). 

There are three types of atomic formulas in R. Let us first consider 
formulas x\ + X2 = X3 and x\ x X2 = X3. Assume we have x\ < x %1 < 
x J ,X2 < x' 12 < x 3 ,X3 < x' 13 < x 3 , with j = max(ii, 12, 13). The variables 
x\, X2, X3 correspond to the tuples (x\, . . . , x\) , (x\, . . . , x 3 2 ), {x\, . . . , x^) 
(padding with as many 0s as necessary) . This includes the case x < x 2 so 
that x corresponds to (0, 1,0,... , 0). Then x\ + X2 = x% is changed into 
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Addj(x\, . . . , x\,x\, . . . , x J 2 , x\, ■ ■ ■ , X3) and x\ x X2 = X3 is changed into 
, arj, x\, . . . , X2, x\, . . . , X3), where the formulas Addj and 
Multj express addition and multiplication on j-tuples in x-ary notation. 
The case of atomic formulas Bit(y, x') is dealt with similarly. Assume we 
have y < 2 xJ , and x' < x l then there are three possibilities. If i < j, then 
Bit(y, x 1 ) is changed into Y(x' 1 , ... , x-, 0, . . . ,0). If i = j, then Bit(y, x') 
is changed into Y(x' 1 ,... ,x'j). If i > j, then Bit(y,x') is changed into 
Y(x' t , . . . , x'j) A Xj + i = A . . . A Xi = 0. Similarly, Bit(y, x) (which may 
also occur in R because a; is a free variable of type x) translates into 
Y(0, 1,0, .. . ,0) if Y has arity 2 at least and into 0/0 (false) if Y is 
unary. Let us denote by R' the first-order sentence thus obtained. 
Finally, cp is Arithmi(<, +, x) A R' . H 

Note that, in order to uniformize the proofs with that of Bennett's theo- 
rem and help comparison, we have slightly modified the original statement 
in two points. First, Mo uses functional vocabularies, which yields bounds 
of type x xJ for y type variables and the use of atoms Digit x (y,x') = x" 
(meaning "the digit of rank x' of y in x-ary notation is x"") instead of 
Bit(y,x). Second, the relation R is originally described using Grzegor- 
czyk's classes £®, E\ or instead of Rud. 

Finally, concerning Asser's problem (so-called second Scholz problem 
here), the author's abstract [94] asserts that: 

It is also shown that if all the functions in £ can be enumerated 
by a function in £ 2 , then the complement of a certain finite 
spectrum cannot be any finite spectrum. Hence, under such a 
condition, the answer to the second Scholz problem is negative. 

Hence, the conditional negative solution proposed here seems to be linked 
to some separation of £ and £ 2 via diagonalization, which seems unlikely 
(the classical proof of separation of £ % and £ l+l uses the bound on the 
growth of the functions in £ l ). 

Appendix B. A compendium of questions and conjectures. 

In this appendix we list, for convenience, all the Open Questions (OQ)and 
stated in our survey. 

From Section^ 

OQ [1} (Scholz) Characterize the sets of natural numbers that are first 
order spectra. 

OQ [2} (Asser) Is the complement of a first order spectrum a first order 
spectrum? 
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OQ [3} Is the complement of a spectrum of an MSOL-sentence again 

a spectrum of an MSOL-sentence? 
OQ [4) (Fagin) Is every first order spectrum the spectrum of a first 

order sentence of one binary relation symbol? 
OQ [5} (Fagin) Is every first order spectrum the spectrum of a first 

order sentence over simple graphs? 
OQ [6} (Fagin) Is every first order spectrum the spectrum of a first 

order sentence over planar graphs? 



From Section^ 



We recall a few definitions: Let M C N + , and let mi , m,2, ... an enumer- 
ation of M ordered by the size of its elements. Xm(^) is the characteristic 
function of M. rjM(n) is the enumeration function of M, i.e., r]M(n) = m n 
if it exists, and r)M{n) = otherwise. Finally, Sm(ti) = t?m(w+1)— 

OQ [3 Which strictly increasing sequences of positive integers, are 
enumerating functions of spectra? For instance, how fast can they 
grow? 

OQ [8} If M is a spectrum how can 8m{ji) behave? 

OQ[9} Let 7r(n) be the counting function of the primes, and let li{n) 
be its approximation by the integral logarithm. Define 
7r + = {n : ir(n) — li(n) > 0} and tt~ = {n : 7r(n) — li(n) > 0}. 
Are the sets tt + and 7r~ spectra? 

OQ HOt Let ip a first order sentence, and be the associated labeled 
counting function that is monotonically increasing. Is there a first 
order sentence tp such that for all n we have /^(n) = r/^,(n)? 

OQ lilt Are there any irrational algebraic reals which are spectral? 

OQ I13t Is every automatic real a spectral real? 

The binary string complexity of a real in binary presentation is the func- 
tion p r {m) which counts, for each m the number of distinct binary words 
w of length m occurring in r. 

OQ I14t Does the binary string complexity p r (m) of a spectral real r 
satisfy 

hminf^M<l 



or even 



hminf^ = l? 



OQ I15t Are the 6-adic £ 2 -computable reals £ 2 -Cauchy computable? 
OQ I16t Is the inclusion Ti ow C £ 2 proper? 
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From Section [^j 

OQ [13 Are the inclusions in £° C S\ C ^ proper : 
OQ I18t Additionally, is the inclusion Rud C E® proper? 
OQ I19t Is the inclusion in 6+ C Spec proper? 
OQ [20) Is the inclusion Rud C Spec proper? 

From Section^ 

OQ[lD 

(i) Are any of the inclusions 

L C NL, LINSPACE C NLINSPACE, P C NP and E C NE 
proper? 

(ii) Do any of the equalities NP = coNP and NE = coNE hold? 
OQ [52} Are the inclusions L C Rud = LTH C LINSPACE proper? 
OQ I23t Is every spectrum the spectrum of a categorical sentence ? 
OQ I24t Is there a universal (complete) spectrum So and a suitable 

notion of reduction such that every spectrum S is reducible to So? 
OQ[25} Is the inclusion d > 1, NTIME(2 d n ) C SPEC d (+) proper? 
OQ I26t Is there a characterization as a complexity class of the classes 

F-SPEC d for all d > 1 ? 

From Section^ 

OQ I27t Is every first order spectrum in SPEC2? 
OQ I28t Is the following hierarchy proper: 

F-SPECJ C F-SPEcf C F-SPEC^ C . . . C F-SPEC^ C . . .? 

OQ [29J Is Rud = F-SPEcf ? 

OQ 0MB Is SPECg = f-spec-l or even SPEC2 = F-SPEcf ? 
OQ I31t Is the following hierarchy proper: 

SPECx C SPEC 2 Q SPEC3 C . . . C SPEC fc c . . .? 

The same question may be asked for spectra over z-ary functions. 
OQ 13 2 1 Does the hierarchy SpFOL k collapse at level 3? 

From Section^ 



OQ I33t (Ash's constant conjecture) 

Is it true that for any finite relational vocabulary a and any positive 
integer k, the Ash function N a ^ is eventually constant? 



FIFTY YEARS OF THE SPECTRUM PROBLEM 



71 



OQ 13 4 1 (Ash's periodic conjecture) 

Is it true that for any finite relational vocabulary a and any positive 
integer k, the Ash function is eventually periodic? 

OQ I35t (Ultra- weak Ash conjecture) 

Is it true that for any finite relational vocabulary a, for any positive 
integer k and for all i G N + , the set N~ k (i) is a spectrum? 

OQ I36t (Ultra- weak Ash conjecture for classes of structures) 

Is it true that for any finite relational vocabulary a, for any first- 
order ex-sentence for any positive integer k and for all i G N, the 
set N Modj{y),S) is a spectrum? 

From Section^ 

OQ I37t What is the complexity of checking whether a r-structure 21 
has patch- width at most k, for a fixed kl 

From Section\^ 

O Q38t Is every spectrum representable as {n + 1 | f(n) = 0} for some 
function / G Kl 

Recall that K is the class of functions defined in Section [A] Definition 
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